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BACKGROUND 

The blood is a vital part of the human circulatory system for the human body. 
Numerous cell types make up the blood tissue includmg leukocytes consisting of 
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granulocytes (neutrophils, eosinophils and basophils), and agranuloctyes ( lymphocytes, 
and monocytes), erythrocytes, platelets, as well as possibly many other undiscovered cell 
types. 

The turnover of cells in the hematopoietic system is enormous. It was reported that 
over one trillion cells, including 200 billion erythrocytes and 70 billion neutrophilic 
leukocytes, turn over each day in the human body (Ogawa 1993). 

The prior art is deficient in simple non-invasive methods to diagnose, prognose, and 
monitor progression and regression of disease and to identify markers related to one or 
more conditions. Although there has been a recent use of expression array phenotyping for 
identification and/or classification of biomarkers of disease, the source of biomarkers has 
been limited to those which are differentially expressed in tissue, thus requiring invasive 
diagnostic procedures (e.g. see Alon U, Barkai N, Notterman DA, Gish K, Ybarra S, Mack 
D, Levine AJ: Broad patterns of gene expression revealed by clustering analysis of tumor 
and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci USA 1999, 
96:6745-6750; Schummer M, Ng WV, Bumgamer RB, Nelson PS, Schummer B, Bednarski 
DW, Hassell L, Baldwin RL, Karlan BY, Hood L Comparative hybridization of an array of 
21500 ovarian cDNAs for the discovery of genes overexpressed in ovarian 
carcinomas.Gene 1999, 238:375-385; van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart 
AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, et al.: Gene 
expression profiling predicts clinical outcome of breast cancer. Nature 2002, 415:530-536; 

SUMMARY OF THE INVENTION 

The present invention provides minimally invasive methods to identify biomarkers 
usefixl for diagnosing a condition, and biomarkers and compositions thereof, wherein the 
biomarkers of the condition are identified fi-om a simple blood sample. Also encompassed 
25 are methods and kits utilizing said biomarkers, especially to diagnose, prognose, and 
monitor conditions, which include disease and non disease conditions.. Accordingly, 
methods of diagnosing disease, monitoring disease progression, and differentially 
diagnosing disease are provided, as well askits usefiil in diagnosing, monitoring disease 
progression and differentially diagnosing disease. 
30 The process described herein requires the use of a blood sample and is, therefore, 

minimally invasive as compared to conventional practices used to detect disease using 
tissue sample biomarkers. 
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Also disclosed are methods representative of means of identifying biomarkers 
which are differentially expressed as between two populations, a first population having a 
condition and a second population having a second condition, or not having a condition. 
The biomarkers thus identified can be used to diagnose an individual with a condition, or 
5 differentially diagnose an individual as having either a first or second condition. 

Other and further aspects, features, and advantages of the representations of 
ttie methods and products presented herein will be apparent firom the following description of 
the presently preferred embodiments. These embodiments are given for the purpose of 
disclosure. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

The above-recited features, advantages and objects of the invention, as well as 
others which will become clear, are attained and can be understood hi detail and more 
particular descriptions of the invention briefly summarized above may be had by reference 
to certain embodiments thereof which are illustrated in the appended drawings. These 
15 drawings form a part of the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and therefore are not to be 
considered limiting in their scope. 

Figure 1 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples fi:om individuals having both osteoarthritis and hypertension as compared with 
20 RNA expression profiles firom mdividuals without either osteoarthritis or hypertension 
("normal'O. 

Figure 2 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples &om individuals who were identified as having both osteoarthritis and who were 
25 obese as described hereua as compared with RNA expression profiles firom individuals 
without either obesity or osteoarthritis (**normal*0. 

Figure 3 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples fi-om individuals who were identified as having both osteoarthritis and allergies as 
30 described herein as compared with RNA expression profiles firom individuals without either 
allergies or osteoarthritis ('^normal"). 
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Figure 4 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals having osteoarthritis and who were subject to systemic steroids as 
described herein as compared with KNA expression profiles firom individuals not taking 
systemic'Steroids and without osteoarthritis ("normal")- 

5 

Figure 5 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples firom individuals having hypertension as compared with KNA expression profiles 
firom samples of both non-hypertensive and normal individuals. 

10 Figure 6 shows a diagrammatic representation of KNA expression profiles of Whole blood 
samples from individuals who were identified as obese as described herein as compared 
with RNA expression profiles from normal and non-obese individuals. 



Figure 7 shows a venn diagram illustrating a summary of the analysis comparing 
15 hypertension and OA patients vs. individuals without hypertension or OA (Table 1 A), 
hypertension and OA patients vs. OA patients (Table IG), and the intersection between the 
two populations of genes (Table IH). 

Figure 8 shows a venn diagram illustrating a sunomary of the analysis comparing obesity and 
OA patients vs. individuals without obesity or OATable IB), obesity and OA patients vs. OA 
20 patients (Table 11), and the intersection between the two populations of genes (Table IT). 

Figure 9 shows a venn diagram illustrating a summary of the analysis comparing allergy and 
OA patients vs individuals without allergy or OA (Table 1 C), allergy and OA patients vs. OA 
patients (Table IK), and the intersection between the two populations of genes (Table IL). 

Figure 10 shows a venn diagram illustrating a summary of the analysis comparing systemic 
25 steroids and OA patients vs. individuals without OA and not exposed to systemic steroids 
(Table ID), systemic steroids and OA patients vs. OA patients (Table IM), and the 
intersection between the two populations of genes (Table IN). 

Figure 1 1 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having OA and being on of three types of 
30 systemic steroids, including hormone replacement therapy, birth control and prednisone. 



13 



wo 2004/1 12589 PCTAJS2004/020836 
Figure 12 shows a diagrammatic representation of RNA expression profiiles of Whole blood 
samples from individuals who were identified as having type 2 diabetes as described herein 
as compared with RNA expression profiles from normal and non-type 2 diabetes 
individuals. 

5 

Figure 13 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples fix)m individuals who were identified as having hyperlipidemia as described 
herein as compared with RNA expression profiles from normal and non-hyperlipidemia 
patients. 

10 

Figure 14 shows a diagranmatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having lung disease as described herein as 
compared with RNA expression profiles from normal and non lung disease individuals. 

15 Figure 15 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were identified as having bladder cancer as described 
herein as compared with RNA expression profiles from non bladder cancer individuals. 

Figure 16 shows a diagrammatic representation of RNA expression profiles of Whole 
20 blood samples from individuals who were identified as having advanced stage bladder 
cancer or early stage bladder cancer as described herein as compared with RNA expression 
profiles from non bladder cancer individuals. 

Figure 17 shows a diagrammatic representation of RNA expression profiles of Whole 
25 blood samples from individuals who were identified as having coronary artery disease 
(CAD) as described herein as compared with RNA expression profiles from non-coronary 
artery disease individuals. 

Figure 18 shows a diagrammatic representation of RNA expression profiles of Whole 
30 blood samples from individuals who were identified as having rheumatoid arthritis as 
described herein as compared with RNA expression profiles from non-rheun^itoid arthritis 
individuals. 
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Figure 19 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples fiom individuals who were identified as having depression as described herein as 
compared with RNA expression profiles from non-depression individuals. 

5 Figure 20 shows a diagrammatic representation of KNfA expression profiles of Whole blood 
samples fiom individuals who were identified as having various stages of osteoarttiritis as 
described herein as compared with RNA expression profiles from individuals without 
osteoarthritis. 

Figure 21 shows a diagrammatic representation of RNA expression profiles of Whole blood 
10 samples from individuals who were identified as having liver cancer as described herein as 
compared with RNA expression profiles from individuals not having liver cancer. 

Figure 22 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having schizophrenia as described herein as 
compared with RNA expression profiles from individuals not having schizophrenia. 

15 Figure 23 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples fit)m individuals who were identified as having symptomatic or asymptomatic 
Chagas' disease as described herein as compared with RNA expression profiles from 
individuals without Chagaas Disease. 

Figure 24 shows a diagrammatic representation of RNA expression profiles of Whole blood 
20 samples from individuals who were identified as having asthma and OA as compared with 
individuals having OA but not asthma . 

Figure 25 shows a diagranraiatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having manic depression syndrome as 
compared with those individuals who have schizophrenia. 

25 Figure 26 shows a representation of the presentation of various stages of OA in patients of 
with respect to the age group of the patients. 

Figure 27 shows RT-PCR of overexpressed genes in CAD peripheral blood cells identified 
using microarray experiments, including PBP, PF4 and F13A. 

Figure 28 shows the the "Blood Chip", a cDNA microarray slide witii 10,368 PCR products 
30 derived from peripheral blood cell cDNA Ubraries. Colors represent hybridization to probes 
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labeled mth Cy3 (green) or Cy5 (red) . Yellow spots indicate common hybri dization between 
both probes. In slide A, normal bl ood cell RNA samples were labeled with Cy3 and CAD 
blood cell RNA samples were labeled with Cy5. In slide B, Cy3 and Cy5 were switched to 
label the RNA samples. (Cluster analysis revealed distinct gene expression profiles for 
5 normal and CAD samples.) 

DETAILED DESCRIPTION: 

Disclosed herein are methods that would be understood by a person skilled in the art 
as representing means of identifying biomarkers which correlate to one or more nucleic 
acid transcripts which are differentially expressed in blood, according to a condition of 

10 interest, wherein the condition of interest includes a disease, a stage of disease, as weU as 
other non-disease conditions.Also disclosed herein is a composition comprising the 
biomarker(s) identified as such, the biomarker(s )themselves, as well as methods of using 
the biomarker(s). Such methods include using the biomarkers to diagnose an individual 
as having a condition ofinterest or a certain stage of a condition of interest, andto 

15 differentiate between two or more conditions. Products which are representative of kits 
useful in diagnosing an individual as having a a condition ofinterest are also disclosed. 

In one embodiment of the mvention, a blood sample is collected firom one or more 
individuals having a condition of interest, and RNA is isolated &om said blood sample. In 
a preferred embodiment the blood sample is whole blood without prior firactionation. In 
20 another preferred embodiment, the blood sample is peripheral blood leukocytes. In another 
preferred embodiment, the blood sample is peripheral blood mononuclear cells (PBMCs). 

Biomarkers are identified by measuring the level of one or more species of RNA 
transcripts or a synthetic nucleic acid copy (cDNA, cRNA etc.) thereof , fi-om one or more 
individuals who have a condition ofinterest or who do not have said condition ofinterest 

25 and/or who are healthy and normal. , In one embodiment, the level of one or more species 
of RNA transcripts is determined by quantitating the level of an RNA species of the 
invention. In one embodiment for example, mass spectrometry may be used to quantify 
the level of one or more species of RNA transcripts (KostCT et al., 1996; Fu et al., 1998). In 
a preferred embodiment, the level of one or more species of RNA transcripts is determined 

30 using microarray analysis. In another preferred embodiment, the level of one or more 

species of RNA transcripts is measured using quantitative RT-PCR. la accordance with the 
present invention, there may be employed other conventional molecular biology, 
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microbiology, and recombinant DNA techniques within the skill of the art in order to 
quantitatively or semi-quantitatively measure one or more species of KNA transcripts. 
Such techniques are explained fully in the literature. See, e.g., Sambrook, Fritsch & 
Maniatis, "Molecular Cloning: A Laboratory Manual (1982); "DNA Cloning: A Practical 

5 Approach," Volumes I and H (D.N. Glover ed. 1985); "Oligonucleotide Syn&esis" (M. J. 
Gait ed. 1984); 'TSTucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; 
"Transcription and Translation" [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell 
Culture" [RJ. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). In a preferred 

10 embodiment quantitative RT-PCR may be used for the purpose of measuring/quantitating 
transcripts in blood. 

In a preferred embodiment, expression levels of one or more species of KNA 
transcripts from a population of samples having a condition of interest are compared those 
levels from a population of samples not having the condition of interestso as to identify 

15 biomarkers which are able to differentiate between the two populations, fri another 

preferred embodiment, expression levels of one or more species of RNA transcripts from a 
population of samples having a fiu:st condition of interest are compared with a those from a 
population of samples having a second condition of interest so as to identify biomarkers 
which can differentiate between said conditions. In another preferred embodiment, when 

20 comparing two populations of individuals to identify biomarkers of a condition of interest, 
the populations are chosen such that the populations share at least one phenotype which is 
not the condition of More preferably the populations have two or more, three or more, four 
or more etc. phenotypes in common. By phenotype is meant any trait which is not the 
condition of interest, for example, in a preferred embodiment individuals within the 

25 populations being used to identify biomarkers of a condition are of a similar age, sex, body 
mass index (BMI). 

The identified biomarkers can be used to determine whether an individual has a 
condition of iaterest. As would be understood to a person skilled in the art, one can utilize 
the biomarkers identified, or combinations of the biomarkers identified, to characterize an 
30 unknown sample in accordance with "class prediction" methods as would be understood by a 
person skilled in the art. 
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The following tenns shall have the definitions set out below: 

A "cDNA" is defined as copy-DNA or complementary-DNA, and is a product of a 
reverse transcription reaction firom an mKNA transcript. "RT-PCR" refers to reverse 
transcription polymerase chain reaction and results in production of cDNAs that are 

5 complementary to the mRNA template(s). RT-PCR includes "QRT-PCR*', quantitative real 
time reverse transcription polymerase chain reaction which uses a labeling means to 
quantitate the level of mRNA transcription and can either be done using the one step or two 
step protocols for the making of cDNA and the amplification step. The labeling means can 
include SYBR® green intercolating dye; TaqMan® probes and Molecular Beacons® as 

10 well as others as would be imderstood by a person skilled in the art. 

The term "oUgonucleotide" is defined as a molecule comprised of two or more 
deoxyribonucleotides and/ or ribonucleotides, preferably more than three. Its exact size 
will depend upon many factors which, in turn, depend upon the ultimate fimction and use of 
the oUgonucleotide. The upper limit may be IS, 20, 25, 30, 40 or SO nucleotides in length. 

IS The term "primer" as used herein refers to an oligonucleotide, whether occurring naturally 
as in a purified restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under conditions in which synthesis of a primer 
extension product, which is complementary to a nucleic acid strand, is induced, i.e., in the 
presence of nucleotides and an inducing agent such as a DNA polymerase and at a suitable 

20 temperature and pH. The primer may be either single-stranded or double-stranded and 
must be sufficiently long to prime the synthesis of the desired extension product in the 
presence of the inducing agent. The exact length of the primer will depend upon many 
factors, including temperature, source of primer and the method used. For example, for 
diagnostic applications, depending on the complexity of the target sequence, the 

25 oligonucleotide primer typically contains 15-25 or more nucleotides, although it may 
contain fewer nucleotides. The factors involved in determining the appropriate length of 
primer are readily known to one of ordinary skiU in the art. 

As used herein, random sequence primers refer to a composition of primers of 
random sequence, i.e. not directed towards a specific sequence. These sequences possess 
30 suflBcient nucleotides complementary to a polynucleotide tohybridize with said 

polynucleotide and the primer sequence need not reflect the exact sequence of the template. 
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"Restriction fragment length polymorphism" refers to variations in DNA sequence 
detected by variations in the length of DNA fragments generated by restriction 
endonuclease digestion. 

A standard Northern blot assay can be used to ascertain the relative amounts of 

5 naRNA in a cell or tissue obtained from plant or other tissue, in accordance with 

conventional Northern hybridization techniques known to those persons of ordinary skill in 
the art. The Northern blot uses a hybridization probe, e.g. radiolabelled cDNA, either 
containing the full-length, single stranded DNA or a fragment of that DNA sequence at 
least 20 (preferably at least 30, more preferably at least 50, and most preferably at least 100 

10 consecutive nucleotides in length). The DNA hybridization probe can be labeled by any of 
the many different methods known to those skilled in this art. The labels most commonly 
employed for these studies are radioactive elements, enzymes, chemicals which fluoresce 
when exposed to ultraviolet light, and others. A number of fluorescent materials are known 
and can be utilized as labels. These include, for example, fluorescein, rhodamine, 

15 auramine, Texas Red, AMCA blue and Lucifer Yellow. A particular detecting material is 
anti-rabbit antibody pr^ared in goats and conjugated with fluorescein through an 
isothiocyanate. Proteins can also be labeled with a radioactive element or with an enzyme. 
The radioactive label can be detected by any of the currenfly available counting procedures. 
The preferred isotope may be selected from ^H, ^"^C, ^^P, ^^S, ^^Cl, ^^Cr, ^Co, ^*Co, ^^e, 

20 ^^'l, and ^*^e. Enzyme labels are likewise useful, and can be detected by any of 

the presently utilized colorimetric, spectrophotometric, fluorospectrophotometric, 
amperometric or gasometric techniques. The enzyme is conjugated to the selected particle 
by reaction with bridging molecules such as carbodiimides, diisocyanates, glutaraldehyde 
and the like. Many enzymes which can be used in these procedures are known and can be 

25 utilized. The preferred are peroxidase, P-glucuronidase, P-D-glucosidase, P-D- 

galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. U.S. 
Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to by way of example for their 
disclosure of alternate labeling material and methods. 

As defined herein, a "nucleic acid array" and "microarray*' refers to a plurality of 
30 unique nucleic acids (or "nucleic acid members") attached to a support where each of the 
nucleic acid members is attached to a support in a unique pre-selected region. In one 
embodiment, the nucleic acid probe attached to the surfece of the support is DNA. In a 
preferred embodiment, the nucleic acid probe attached to the surface of the support is either 
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cDNA or oligonucleotides. In another preferred embodiment, the nucleic acid probe attached 
to the surface of the support is cDNA synthesized by polymerase chain reaction (PGR). The 
term "nucleic acid", as used herem, is interchangeable with the term "polynucleotide". In 
another preferred embodiment, a "nucleic acid array" refers to a plurality of unique nucleic 
5 acids attached to nitrocellulose or other membranes used in Southern and/or Northem blotting 
techniques. 

As used herein, " an individual" refers to a human subject as well as a non-human 
subject such as a mammal, an invertebrate, a vertebrate, a rat, a horse, a dog, a cat, a cow, a 

10 chicken, a bird, a mouse, a rodent, a primate, a fish, a firog and a deer. The examples herein 
are not meant to limit the methodology of the present invention to a human subject only, as 
the instant methodology is usefiil in the fields of veterinary medicine, animal sciences and 
such. The term "individual" refers to a human subject and a non-human subject who are 
condition firee and also includes a human and a non-human subject diagnosed with one or 

15 more conditions, as defined herein. "Co-morbid individuals" or "comorbidity" or 

"individuals considered as co-morbid" are individuals who have more than one condition as 
defined herein. For example a patient diagnosed with both osteoarthritis and hypertension 
is considered to present with comorbidities. 

As used herein, "detecting" refers to determining the presence of a one or more 
20 species of RNA transcripts, for example cDNA, RNA or EST, by any method known to 
those of skill in the art or taught in numerous texts and laboratory manuals (see for 
example, Ausubel et al. Short Protocols in Molecular Biology (1995) 3rd Ed. John Wiley & 
Sons, Inc.). For example, methods of detection include but are not limited to, RNA 
fingerprinting, Northem blotting, polymerase chain reaction, ligase chain reaction, Qbeta 
25 replicase, isothermal amplification method, strand displacement amplification, transcription 
based amplification systems, nuclease protection (SI nuclease or RNAse protection assays) 
as well as methods disclosed in WO88/10315, WO89/06700, PCT/US87/00880, 
PCT/US89/01025. 

As used herein, a "condition" of the invention refers to a mode or state of 
30 being iucluding a physical, emotional, psychological or pathological state. A condition can 
be as a result of both "genetic" and/or "environmental" factors. By "genetic factors" is meant 
genetically inherited factors or characteristics inherent as a result of the genetic make up of 

20 
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the individual. By "environmental factors" is meant those factors which are not genetically 
inherited, but which are the result of exposure to intemal or external influences. In one 
embodiment of the invention, a condition is a disease as defined herein. In another 
embodiment of the invention, a condition is a stage of a disease as defined herein. In yet 
5 another embodiment of the invention, a condition is a mode or state of being which is not a 
disease. For example in one embodiment, a condition which is not a disease is a condition 
resulting firom the progression of time. A condition resulting Scorn progression of time can 
include, but is not limited to: memory loss, loss of skin elasticity, loss of muscle tone, and 
loss of sexual desire. In a finther embodiment of the invention a condition which is not a 

10 disease is a treatment A treatment can include, but is not limited to disease modifying 
treatments as well as treatments usefiil in mitigating the symptoms of disease. For example 
treatments can include drugs specific for a disease of the invention. In a preferred 
embodiment, treatments can include drugs specific for Alzheimer's, Cardiovascular disease, 
Manic Depression Syndrome, Schizophrenia, Diabetes and Osteoarthritis. For example, 

15 treatments can include but are not limited to VIOXX®, Celebrex®, NSAE)S, Cortisone, 
Visco supplement, Lipitor®, Adriamycin®, Cytoxan®, Herceptin®, Nolvadex® Avastin®, 
Erbitux®, Fluorouracil®, Largactil®, Sparine®, Vesprin®, Stelazine®, Fentazine®, 
Prolixin®, Compazine®, Tindal®, Modecate®, Moditen®, Mellarin, Serentil, Norvane, ®, 
Fluanxol®, Clopixol®, Taractan®, Depixol®, Clopixol®, Haldol®, Haldol® Decanoate, 

20 Orap®, Inapsine®, Imap®, Semap®, Loxitane®, Daxol®, lithium, anticonvulsants (for ex. 
carbamazepine) and antidepressants and Moban®. More, generally and addition, a treatment 
can include any treatment or drug described in the Compendium of Pharmaceuticals and 
Specialties, Canadian Pharmaceutical Association; 26'^ edition, June, 1991; Krogh, 
Compendium of Pharmaceuticals and Specialties, Canadian Pharmaceutical Association; 27^ 

15 edition, April, 1992. In a fiirther embodiment, a condition of the invention which is not a 
disease is a response to environmental factors including but not limited to pollution, 
environmental toxins, lead poisoning, mercury posining, exposure to genetically modified 
organisms, exposure to radioactivity, pesticides, insecticides, and cigarette smoke, alcohol, or 
exercise. In a further embodiment, a condition is a state of health. 

W As used herein, a disease of the invention includes, but is not limited to, blood 

disorder, blood lipid disease, autoimmune disease, arthritis (including osteoarthritis, 
rheumatoid arthritis, lupus, allergies, juvenile rheumatoid arthritis and the like), bone or joint 
disorder, a cardiovascular disorder (including heart failure, congenital heart disease; 
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rheumatic fever, valvular heart disease; coxpulmonale, cardiomyopathy, myocarditis, 
pericardial disease; vascular diseases such as atherosclerosis, acute myocardial infarction, 
ischemic heart disease and the Uke), obesity, respiratory disease (including asthma, 
pneumonitis, pneumonia, puhnonary infections, lung disease, bronchiectasis, tuberculosis, 

5 cystic fibrosis, interstitial lung disease, chronic bronchitis emphysema, puhnonary 
hypertension, pulmonary thromboembolism, acute respiratory distress syndrome and the 
like), hyperlipidemias, endocrine disorder, immune disorder, infectious disease, muscle 
wasting and whole body wasting disorder, neurological disorders (including migraines, 
seizures, epUepsy, cerebrovascular diseases, alzheimers, dementia, Parkinson's, ataxic 

10 disorders, motor neuron diseases, cranial nerve disorders, spinal cord disorders, meningitis 
and the like) including neurodegenerative and/or neuropsychiatric diseases and mood 
disorders (including schizophrenia, anxiety, bipolar disorder; manic depression and the like, 
skin disorder, kidney disease, scleroderma, stroke, hereditary hemorrhage telangiectasia, 
diabetes, disorders associated with diabetes (e.g., PVD), hypertension, Gaucher's disease, 

15 cystic fibrosis, sickle cell anemia, liver disease, pancreatic disease, eye, ear, nose and/or 

throat disease, diseases affecting the reproductive organs, gastrointestinal diseases (including 
diseases of the colon, diseases of the spleen, appendix, gall bladder, and others) and the like. 
For further discussion of human diseases, see Mendelian Inheritance in Man: A Catalog of 
Human Genes and Genetic Disorders by Victor A, McKusick (12th Edition (3 volume set) 

20 June 1 998, Johns Hopkins University Press, ISBN: 0801 857422) and Harrison's Principles of 
Intemal Medicme by Braunwald, Fauci, Kasper, Hauser, Longo, & Jameson (15th Edition 
2001), the entirety of which is incorporated herein. 

In another embodiment of the invention, a disease refers to an immune disorder, 
such as those associated with overexpression of a gene or expression of a mutant gene (e.g., 

25 autoimmune diseases, such as diabetes mellitus, arthritis (including rheumatoid arthritis, 
juvenile rheumatoid arthritis, osteoarthritis, psoriatic arthritis), multiple sclerosis, 
encephalomyelitis, myasthenia gravis, systemic lupus erythematosis, automimmune 
thyroiditis, dermatitis (including atopic dermatitis and eczematous dermatitis), psoriasis, 
Sjogren's Syndrome, Crohn's disease, ulcerative colitis, aphthous ulcer, iritis, 

30 conjunctivitis, keratoconjunctivitis, ulcerative cohtis, asthma, allergic asthma, cutaneous 
lupus erythematosus, scleroderma, vaginitis, proctitis, drug eruptions, leprosy reversal 
reactions, erythema nodosum leprosum, autoimmune uveitis, allergic encephalomyelitis, 
acute necrotizing hemorrhagic encephalopathy, idiopathic bilateral progressive 
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sensorineural hearing, loss, aplastic anemia, pure red cell anemia, idiopathic 
thrombocytopenia, polychondritis, Wegener's gramdomatosis, chronic active hepatitis, 
Stevens- Johnson syndrome, idiopathic sprue, lichen planus. Graves' disease, sarcoidosis, 
primary biliary cirrhosis, uveitis posterior, and interstitial lung fibrosis), graft-versus-host 
5 disease, cases of transplantation, and allergy. 

In another embodiment, a disease of the invention is a cellular proliferative and/or 
differentiative disorder that includes, but is not linodted to, cancer e.g., carcinoma, sarcoma 
or other metastatic disorders and the like. As used herein, the term "cancer" refers to cells 
having the capacity for autonomous growth, i.e., an abnomial state of condition 

10 characterized by rapidly proliferating cell growth. "Cancer" is meant to include all types of 
cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed 
cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness. 
Examples of cancers include but are not limited to solid tumors and leukemias, including: 
apudoma, choristoma, brancbioma, malignant carcinoid syndrome, carcinoid heart disease, 

15 carcinoma (e.g.. Walker, basal cell, basosquamous, Brown-Pearce, ductal, Ehrlich tumour, 
in situ, Krebs 2, Merkel cell, mucinous, non-small cell lung, oat cell, papillary, scirrhous, 
bronchiolar, bronchogenic, squamous cell, and transitional cell), histiocytic disorders, 
leukaemia (e.g., B cell, mixed cell, null cell, T cell, T-cell chronic, HTLV-II-associated, 
lymphocytic acute, lymphocytic chronic, mast cell, and myeloid), histiocytosis malignant, 

20 Hodgkin disease, immunoproliferative small, non-Hodgkin lymphoma, plasmacytoma, 
reticuloendotheliosis, melanoma, chondroblastoma, chondroma, chondrosarcoma, fibroma, 
fibrosarcoma, giant cell tumors, histiocytoma, hpoma, liposarcoma, mesothelioma, 
myxoma, myxosarcoma, osteoma, osteosarcoma, Ewing sarcoma, synovioma, 
adenofibroma, adenolymphoma, carcinosarcoma, chordoma, craniopharyngioma, 

15 dysgenninoma, hamartoma, mesenchymoma, mesonephroma, myosarcoma, 

ameloblastoma, cementoma, odontoma, teratoma, thymoma, trophoblastic tumour, adeno- 
carcinoma, adenoma, cholangioma, cholesteatoma, cylindroma, cystadenocarcinoma, 
cystadenoma, granulosa cell tumour, gynandroblastoma, hepatoma, hidradenoma, islet cell 
tumour, Leydig cell tumour, papilloma, Sertoli cell tumour, theca cell tumour, leiomyoma, 

JO leiomyosarcoma, myoblastoma, mymoma, myosarcoma, rhabdomyoma, 

rhabdomyosarcoma, ependymoma, ganglioneuroma, glioma, meduUoblastoma, 
meningioma, neurilenunoma, neuroblastoma, neuroepithelioma, neurofibroma, nexuroma, 
paraganglioma, paraganglioma nonchromaffin, angiokeratoma, angiolymphoid hyperplasia 
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with eosinophilia, angioma sclerosing, angiomatosis, glomangioma, 
hemangioendothelioma, hemangioma, hemangiopericytoma, hemangiosarcoma, 
lymphangioma, lymphangiomyoma, lymphangiosarcoma, pinealoma, carcinosarcoma, 
chondrosarcoma, cystosarcoma, phyllodes, fibrosarcoma, hemangiosarcoma, 

5 leimyosarcoma, leukosarcoma, liposarcoma, lymphangiosarcoma, myosarcoma, 

myxosarcoma, ovarian carcinoma, rhabdomyosarcoma, sarcoma (e.g., Ewing, experimental, 
Kaposi, and mast cell),. neoplasms (e.g., bone, breast, digestive system, colorectal, liver, 
pancreatic, pituitary, testicular, orbital, head and neck, central nervous system, acoustic, 
pelvic respiratory tract, and urogenital), neurofibromatosis, and cervical dysplasia, and 

10 other conditions in which cells have become immortalized or transformed. 

"Cardiovascular Disease" is defined herein as any disease or disorder of the cardiovascular 
system and includes arteriosclerosis, heart valve disease, arrhythmia, and orthostatic 
hypotension, shock, endocarditis, diseases of the aorta and its branches, disorders of the 
peripheral vascular system, and congenital heart disease.as a disease affecting the heart or 
15 blood vessels. Cardiovascular diseases include coronary artery disease, hearart failure, and 
hypertension. 

As used herein ^^Neurological Disease" is defined as a disorder of the nervous 
system, and include disorders that involve the central nervous system (brain, brainstem and 
cerebellum), the peripheral nervous system (including cranial nerves), and the autonomic 
20 nervous system (parts of which are located in both central and peripheral nervous system). 
In particular neurological disease includes alzheimers% schizophrenia, and manic 
depression syndrome. 

As used herein, a "population" or a "population of individuals" of the invention 
25 refers to a population of two or more individuals wherein the individuals have at least a 
single condition of interest in common. A population of the invention can also have two or 
more conditions in common. A population of the invention can also be comprised of two or 
more individuals who do not have a condition of interest. 

30 As used herein, "diagnosis" refers to the ability to demonstrate an increased 

likelihood that an individual has a specific condition or conditions. Diagnosis also refers to 
the ability to demonstrate an increased likelihood that an individual does not have a specific 
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condition. More particularly "diagnosis" refers to the ability to demonstrate an increased 
likelihood that an individual has one condition as compared to a second condition. More 
particularly "diagnosis'' refers to a process vdiereby there is an increased likelihood that an 
individual is properly characterized as having a condition (*true positive") or is properly 
5 characterized as not having a condition ("true negative") while minimizing the likelihood 
that the individual is improperly characterized with said condition ("false positive") or 
improperly characterized as not being afflicted with said condition ("false negative"). 

As used herein, '^eatment" refers to the administration of a drug, pharmaceutical, 
10 nutraceutical, or other form of ther^eutic regime which has the potential to reverse or 

ameliorate the pathology of a disease condition, produce a change in a condition as measured 
by either the lessening of the number or severity of symptoms or effects of the condition, as 
determined by a physician. In a preferred embodiment a treatment of the invention is a 
treatment for a disease. In another preferred embodiment, a treatment of the invention is a 
15 treatment of a disease selected from the group of: liver cancer, urinary bladder cancer, 
gallbladder cancer, brain cancer, prostate cancer, ovarian cancer, cervical cancer, kidney 
cancer, gastric cancer, colon canc^, lung cancer, breast cancer, nasopharyngeal cancer, 
pancreatic cancer, osteoarthritis, depression, hypertension, heart failure, obesity, rheumatoid 
arthritis, hyperlipidemia, lung disease, Chagas' disease, allergies, schizophrenia and asthma, 
20 manic depression syndrome, ankylosing spondylitis, guillain baire syndrome, fibromyalgia, 
multiple sclerosis, muscular dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or 
colitis, or malignant hyperthermia susceptibility, psoriasis, thyroid disorder, irritable bowel 
syndrome, osteoporosis, migraines, eczema, or a heart muimer. 

25 As used herein, a "response to treatment" indicates a physiological change as a 

result of the "application of treatment" to a condition where "treatment" includes 
pharmaceuticals, neutraceuticals, and other drugs or treatment regimes. The relative 
success of a response to treatment is determined by a physician. As used herein, by the 
term "treatment regime" is meant a course of treatment ranging fi-om a single application or 

30 dose to multiple applications of one or more doses over time. 

As used herein, a "biomarker" is a molecule which corresponds to a species of a 
nucleic acid transcript that has a quantitatively diSerential concentration or level in 
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bloodwith respect to an aspect of the condition of interest. As such, a biomarker includes a 
synthetic nucleic acid copolymer thereof, including cRNA, cDNA, and the like A species of a 
nucleic acid transcript includes any nucleic acid transcript which is transcribed from any part 
of the individual's chromosomal and extrachromosomal genome including for example the 
5 mitochondrial genome. Preferably a species of a nucleic acid transcript is an RNA transcript, 
preferably the RNA transcript includes a primary transcript, a spliced transcript, an 
alternatively spliced transcript, or an mRNA. An aspect of the condition of interest includes 
the presence or absence of the condition in an individual or group of individuals for which the 
biomarker is identified or assayed, and also includes the stage of progression or regression of 

10 a condition including a disease condition. For example, a biomarker is a molecule which 
corresponds to a species of an RNA transcript which is present at an increased level s or a 
decreased level of in the blood of an individual or a population of individuals having at 
least one condition of interest, when compared to the level of said transcript in the blood firom 
a population of individuals not having said condition of mterest.. Molecules encompassed by 

15 the term biomarker includeESTs, cDNAs, primers, etc, A biomarker can be used either 
solely or in conjunction with one or more other identified biomarkers, so as to allow 
diagnosis of a condition of interest as defined herein. 

As used herein, the term " concentration or level" of a species of an RNA 
transcript refers to the measurable quantity of a given biomarker. The " " concentration or 

20 lever " of a species of an RNA transcript can be determined by measuring the level of RNA 
using semi-quantitative methods such as microarray hybridization or more quantitative 
measurements such as quantitative real-time RT-PCR which corresponds in direct proportion 
with the extent to which the gene is expressed. The "concentration or level" ofa species of 
an RNA transcript is determined by methods well known in the art. As used herein the term 

25 "differential expression" refers to a difference in the level of expression of a species of an 
RNA nucleic acid transcript, as measured by the amount or level of RNA or can also include 
a measuremrat of the protein encoded by the gene corresponding to the nucleic acid 
transcript, in a sample or population of samples as compared with the amount or level of 
RNA or protein expression of the same nucleic acid transcript in a second sample or second 

30 population of samples. The term "differentially expressed" or "changes in the level of 
expression" refers to an increase or decrease in the measurable expression level of a given 
biomarker in a sample as compared with the measurable expression level of a given 
biomarkerin a second sample. The term "differentially expressed" or "changes in the level of 
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expression" can also refer to an increase or decrea3e in the measurable expression level of a 
given biomarkerin a population of samples as compared with the measurable expression 
level of a biomarkerin a second population of samples. As used herein, "differentially 
expressed" when referring to a single sample can be measured using the ratio of the level of 
5 expression of a given nbiomarker in said sample as compared with the mean expression level 
of the given biomaiker of a control population wherein the ratio is not equal to 1.0. 
DiflFerentially expressed can also be used to include comparing a first population of samples 
as compared with a second population of samples or a single sample to a population of 
samples using either a ratio of the level of expression or using p-value. When using p-value, 
10 a nucleic acid transcript is identified as being differentially expressed as between a first and 
second population when the p-value is less than 0.1. More preferably the p-value is less than 
0.05. Even more preferably the p-value is less than 0.01. More preferably still the p-value is 
less than 0.005. Mostpreferably the p-value is less than 0.001. When deternuning whether a 
nucleic acid transcript is differentially expressed on the basis of the ratio of the level of 
15 expression, a nucleic acid transcript is differentially expressed if the ratio of the level of 
expression of a nucleic acid transcript in a first sample as compared with a second sample is 
greater than or less than 1.0. For example, a ratio of greater than 1.2, 1.5, 1.7, 2, 3, 4, 10, 
20oraratiolessthanl,forexample0.8, 0.6,0.4, 0,2, 0.1. 0.05. In another embodiment of 
the invention a nucleic acid transcript is differentially expressed if the ratio of the mean of the 
20 level of expression of a first population as compared with the mean level of expression of the 
second population is greater than or less than 1.0 For example, a ratio of greater than 1 .2, 1.5, 
1.7, 2, 3, 4, 10, 20 or a ratio less than 1, for example 0.8, 0.6, 0.4, 0.2, 0.1. O.OSIn anotiier 
embodiment of the invention a nucleic acid transcript is differentially expressed if the ratio of 
its level of expression in a first sample as compared with the mean of the second population 
25 is greater than or less than 1 ,0 and includes for example, a ratio of greater than 1 .2, 1 .5, 1 .7, 
2, 3, 4, 10, 20, or a ratio less tiian 1, for example 0.8, 0.6, 0.4, 0.2, 0.1. 0.05. "Differentially 
increased expression" refers to 1.1 fold, 1.2 fold, 1.4 fold, 1.6 fold, 1.8 fold, or more, relative 
to a standard, such as the mean of the expresion level of the second population. 
"Differentially decreased expression" refers to less than 1.0 fold, 0.8 fold, 0.6 fold, 0.4 fold, 
30 0.2 fold, 0. 1 fold or less, relative to a standard, such as the mean of the expresion level of the 
second population.. 

A nucleic acid transcript is also said to be differentially expressed in two samples if 
one of the two samples contains no detectable expression of the nucleic acid transcript. 
Absolute quantification of the level of expression of a nucleic acid transcriptcan be 
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accomplished by including known concentration(s) of one or more control nucleic acid 
transcript, generating a standard curve based on the amount of the control s nucleic acid 
transcriptand extrapolating the expression level of the '^unknown" nucleic acid transcript, for 
example, from the real-time RT PGR hybridization intensities of the unknown with respect to 
5 the standard curve. 

By a ntlcleic acid transcript that is "expressed in blood" is meant a nucleic acid 
transcript that is expressed in one or more cells of blood, wherein the cells of blood include 
monocytes, leukocytes, lymphocytes, erythrocytes, all other cells derived directly from 
10 haemopoietic or mesenchymal stem cells, or cells derived directly from a cell which typically 
makes up the blood. 



The term "biomarker" further includes any molecule that correlates to, or is 
reflective of the transcript produced from any region of nucleic acid that can be transcribed, 

15 as the invention contemplates detection of RNA or equivalents thereof, i.e., cDNA or EST. 
A biomarker of the invention includes but is not limited to regions which are translated 
into proteins which are specific for or involved in a particular biological process, such as 
apoptosis, differentiation, stress response, aging, proliferation, etc.; cellular mechanism 
genes, e.g. cell-cycle, signal transduction, metabolism of toxic compounds, and the like; 

20 disease associated genes, e.g. genes involved in cancer, schizophrenia, diabetes, high blood 
pressure, atherosclerosis, viral-host interaction, infection and the like. A biomarker of the 
the invention includes, but is not limited to transcripts transcribed from immune response 
genes. A gene of the invention is a biomarker of a condition and can be a biomarker of 
disease, or a biomarker of a non disease condition as defined herein. 

25 For example, a biomolecule can be reflective of or correlate to the transcript from any 

gene , includingan oncogene (Hanahan, D. and R.A. Weinberg, Cell (2000) 100:57; and 
Yokota, J., Carcinogenesis (2000) 21(3):497-503) whose expression within a cell induces that 
cell to become converted from a normal cell into a tumor cell. Examples of genes which 
produce transcript(s)'to which a biomarker is correlated to or reflective of, include, but are 

30 not limited to, include cytokine genes (Rubinstein, M., et al., Cytokine Growth Factor Rev. 
(1998) 9(2):175-81); idiotype (Id) protein genes (Benezra, R., et al., Oncogene (2001) 
20(58):8334-41; Norton, J.D., J. Cell Sci. (2000) 113(22):3897-905); prion genes (Prusiner, 
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S.B., et al., CeU (1998) 93(3):337-48; Safar, J., and S.B. Prusiner, Prog. Brain Res. (1998) 
117:421-34); genes that express molecules that induce angiogenesis (Gould, V.E. and B.M. 
Wagaer, Hum. Pathol. (2002) 33(ll):1061-3); genes encoding adhesion molecules (Chothia, 
C. and E.V. Jones, Annu. Rev. Biochem. (1997) 66:823-62; Parise, L.V., et al., Semin. 
5 CancCT Biol. (2000) 10(6):407-14); genes encoding cell surface receptors (Deller, M.C., and 
Y.E. Jones, Curr. Opin. Struct. Biol. (2000) 10(2):213-9); genes of protems that are involved 
in metastasizing and/or invasive processes (Boyd, D,, Cancer Metastasis Rev. (1996) 
15(l):77-89; Yokota, J., Carcinogenesis (2000) 21(3):497-503); genes of proteases as well as 
of molecules that regulate apoptosis and the cell cycle (Matrisian, L.M., Curr. Biol. (1999) 
0 9(20):R776-8; Krepela, E., Neoplasma (2001) 48(5):332-49; Basbaum and Werb, Cuir. Opin. 
Cell Biol. (1996) 8:731-738;'Birkedal-Hansen, et al., Crit Rev. Oral Biol. Med. (1993) 
4:197-250; Mignatti and Rifldn, Physiol. Rev. (1993) 73:161-195; Stetler-Stevenson, et al., 
Annu. Rev. Cell Biol. (1993) 9:541-573; Brinkerhoff, E., and L.M. Matrisan, Nature Reviews 
(2002) 3:207-214; Strasser, A., et al, Annu. Rev. Biochem. (2000) 69:217-45; Chao, D.T. 

L5 and S J. Korsmeyer, Annu. Rev. Immunol. (1998) 16:395-419; Mullauer, L., et al., Mutat 
Res. (2001) 488(3):21 1-31; Fotedar, R., et al., Prog. Cell Cycle Res. (1996) 2:147-63; Reed, 
J.C., Am. J. Pathol. (2000) 157(5):1415-30; D'Ari, R., Bioassays (2001) 23(7):563^5); or 
multi-drug resistance genes, suach as MDRl gene (Childs, S., and V. Ling, Imp. Adv. Oncol. 
(1994) 21-36). In another embodiment, a gene which produces transcript(s) to which a 

20 biomarker is correlated to or reflective of, include, but ard not limited to, an immune 
response gene or a non-immune response gene. By an immune response gene is meant a 
primary defense response gene located outside the major histocompatibility region (MHC) 
that is initially triggered in response to a foreign antigen to regulate immune responsiveness. 
All other genes expressed in blood are considered to be non-inamune response genes. For 

25 example, an immune response gene would be understood by a person skilled in the art to 
include: cytokines including interleukins and interferons such as TNF-alpha, BL-IO, IL-12, 
IL-2, ILr4, IL-10, IL-12, IL-13, TGF-Beta, IFN-ganam^^ immunoglobulins, complement and 
the like (see for example Bellardelli, F. Role of interferons and other cytokines in the 
regulation of the immune response APMIS. 1995 Mar; 103(3): 1 61-79; ); . 

30 

Construction of a Nucleic Acid Array 

A nucleic acid microarray (KNA, DNA, cDNA, PCR products or ESTs) according to 
fte invention can be constructed as follows: 
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Nucleic acids (RNA, DNA, cDNA, PCR products or ESTs) (^Opl) are precipitated 
with 4^1 (1/10 volume) of 3M sodium acetate (pH 5.2) and 100 ul (2.5 volumes) of ethanol 
and stored overnight at -20°C. They are then centrifuged at 3,300 ipm at 4*C for 1 hour. 
The obtained pellets are washed with 50nl ice-cold 70% ethanol and centrifuged again for 30 
5 minutes. The pellets are ihea air-dried and resuspended well in 50% dimethylsulfoxide 
(DMSO) or 20id 3X SSC overnight The samples are then deposited either singly or in 
duplicate onto Gamma Amino Propyl Silane (Coming CMT-GAPS or CMT-GAP2, Catalog 
No. 40003, 40004) or polylysine-coated slides (Sigma Cat No. P0425) using a robotic GMS 
417 or 427 arrayer (Afifymetrix, CA). The boundaries of flie DNA spots on the microairay 
10 are marked with a diamond scriber. The invention provides for arrays where 10-20,000 
difiTerent DNAs are spotted pnto a solid support to prepare an array, and also may include 
duplicate or triplicate DNAs. 

The arrays are rehydrated by suspending the slides over a dish of warm particle fiee 
ddH20 for approximately one minute (the spots will swell slightly but not run into each other) 

15 and snap-dried on a 70-80**C inverted heating block for 3 seconds. DNA is then UV 

crosslinked to the slide (Stratagene, Stratalinker, 65nJ - set display to "650" which is 650 x 
lOOjiJ) or baked at 80°C for two to four hours. The arrays are placed in a slide rack. An 
empty slide chamber is prepared and filled with the following solution: 3.0 grams of succinic 
anhydride (Aldrich) is dissolved in 189ml of l-methyl-2-pyrrolidinone (rapid addition of 

20 reagent is crucial); immediately after the last flake of succinic anhydride dissolved, 21 .0ml of 
0.2 M sodium borate is mixed in and the solution is poured into the slide chamber. The slide 
rack is plunged rapidly and evenly in the slide chamber and vigorously shaken up and down 
for a few seconds, making sure the slides never leave the solution, and then mixed on an 
orbital shaker for 15-20 minutes. The slide rack is then gently plunged in 95°C ddH20 for 2 

>5 minutes, followed by plunging five times in 95% ethanol. The slides are then air dried by 
allowing excess ethanol to drip onto paper towels. The arrays are then stored in the slide box 
at room temperature until use. 



Nucleic acid Arrays 

10 A nucleic acid array comprises any combination of the nucleic acid sequences 

generated fi-om, or complementary to nucleic acid transcripts, or regions thereof, including 
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the species of nucleic acid transcripts present in blood . Preferably, for identifying 
biomarkers of a disease or condition of interest, one utilizes a microarray so as to minimize 
cost and time of the experiment In one embodiment, the microarray is an EST microarray 
which includes ESTs conq)lementary to genes ejcpressed in blood. A microarray according tc 

5 the invention preferably comprises between 10, 100, 500, 1000, 5000, 10,000 and 15,000 
nucleic acid members, and more preferably conq)rises at least 5000 nucleic acid members. 
The nucleic acid members are known or novel nucleic acid sequences desaibed herein, or 
any combination thereof. A microarray according to tiie invention is used to assay for 
differential levels of species of tianscripts RNA expression profilespresent in blood samples 

10 fix)m healthy patients as compared to patients with a disease. 



Microarravs 

Microarrays include those arrays which encompass transcripts which are expressed in 
the individual. In one embodiment, a microarray which enconq)asses transcripts which are 
IS expressed in humans, lii a preferred embodiment microarrays of the invention can be eitiier 
cDNA based arrays or oligonucleotide based arrays. 

Oligonucleotide Arrays 

In a preferred embodiment, the oligonucleotide based microarrays of Afiymetrix® are 
utilized. More particularly the Affymetrix® Human Grenome U133 (HG-U133) Set, 

20 consisting of two GeneChip® arrays, contains ahnost 45,000 probe sets representing more 
flian 39,000 transcripts derived from approximately 33,000 well-substantiated human genes. 
This set design uses sequences selected from GenBank®, dbEST, and RefSeq. More recently 
Affymetrix® has available tiie UBS Plus 2.0 GeneChip® which represents over 47,000 
transcripts. It is expected as more genes and transcripts are identified as a result of the human 

25 genome sequencing project, additional generations of microarrays will be developed. 

The sequence clusters were created from the UniGene database (Build 133, April 20, 
2001). They were tiien refined by analysis and comparison with a number of other publicly 
available databases including the Washington University EST trace repository and the 
University of California, Santa Cruz Golden Path human genome database (April 2001 
30 release). 
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The HG-U133A Array includes representation of the RefSeq database sequences and 
probe sets related to sequences previously represented on the Human Genome U95 Av2 
Array. The HG-U133B Array contains primarily probe sets representing EST clusters. 

The U133 Plus 2.0 Array includes all probe sets represented on the GeneChip Human 
5 Genome U133 Set (U133A and U133B). The U133 Plus 2.0 includes an additional 6,500 
genes for analysis of over 47,000 transcripts. 

cDNA Based Arrays 

15 K ChondroChip™ - The ChondroChip™ is an EST based microarray and includes 
approximately 15,000 ESTs complementy to genes also expressed in human chondrocytes. 
10 Various versions of the 1 5K ChondroChip™ were used, dependmg upon the experiment in an 
ejffort to utilize a microarray which reduced redundancy so as to increase the percentage of 
unique genes and thus encompass representation of as much of the entire genome as 
possible. 

Controls on the ChondroChip'™ - There are two types of controls used on microarrays. 
15 First, positive controls are genes whose expression level is invariant between different stages 
of investigation and are used to monitor: 

a) target DNA binding to the slide, 

b) quality of the spotting and binding processes of the target DNA onto the slide, 

c) quality of the RNA samples, and 

20 d) efficiency ofthe reverse transcription and fluorescent labeling of the probes. 

Second, negative controls are extemal controls derived from an organism unrelated to 
and therefore unlikely to cross-hybridize with the sample of interest. These are used to 
monitor for: 

a) variation in background fluorescence on the slide, and 
25 b) non-specific hybridization. 

There are currently 63 control spots on the ChondroChip™ consisting of: 

Type Ng^ 

Positive Controls: 2 
yl/fe^zDNA 12 
30 A.thalianaDNA 10 
Spotting Buffer 41 
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BloodChipTM . The "BloodChip™" can also be used. The BloodChip is a cDNA microarray 
slide with 10,368 PGR products derived ftom peripheral blood cell cDNA libraries as shown 
in Figure 24. 

30K BodyChip™ - The BodyChip™ is an EST based microarray which incorporates the 
5 unique cDNA clones from both the BloodChip™ and the ChondroChip™. The BodyChip™ 
includes coverage of over 30,000 genes. 

Identifying Biomarkers Useful in Accordance with the Invention 
Collection of Blood 

Blood is drawn according to the methods of standard phlebotomy. A blood sample 
10 useful according to the invention is a blood sample ranging in volume from as little as a 
drop of blood to 100ml, more preferably a blood sample is 10ml to 60 ml, even more 
preferably a blood sample is between 25ml to 40ml. A blood sample that is useful 
according to the invention is in an amount that is sufficient for the detection of one or more 
genes according to the invention. 

15 

In one embodiment, 30 mis of blood is isolated and stored on ice withm a K3/EDTA 
tube. In another embodiment, one can utilize tubes for storing blood which contain 
stabilizing agents such as disclosed fai US patent 6,617,170. In another embodiment the 
PAXgene™ blood RNA systemiprovided by PreAnalytiX, a Qiagen/BD company may be 
20 used to collect blood. The PAXgene™ system is standardized on convenient BD 

Vacutainer™ technology. In yet another embodiment, the Tempus™ blood RNA collection 
tubes, offered by Applied Biosystems may be used. Tempus™ collection tubes provide a 
closed evacuated plastic tube containing KNA stabilizing reagent for whole blood collection, 
processing and subsequently RNA isolation. 

25 

In a preferred embodiment, RNA is isolated from said blood sample stored on ice 
within 24 hours, more preferably within 10 hours, even more preferably within 6 hours of 
collection most preferably immediately after drawing said blood. In another preferred 
embodiment, wherem stabilizers are utilized, such as with die PAXgene™ system, RNA is 
30 isolated from said blood sample can be isolated after storage at room temperature for 2-4 
days, or isolated from a blood sample stored at 4° C for a number of weeks. 
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Ixnladan and Preparation ofRNA 
Elnod Samples 

5 ' In another aspect of the invention, a blood sample, as used herein, refers to a sample 
of whole blood without prior fractionation, a sample of subsets of blood cells, and a sample 
of specific types of blood cells. Accordingly, a blood sample includes, but is not limited to, 
whole blood without prior fractionation, peripheral blood leukocytes (PBL's), granulocytes, 
agranulocytes, T lymphocytes, B lymphocytes, monocytes, macrophages, eosinophils, 

10 neutrophils, basophils, erythrocytes, and platelets separated &om whole blood. 

Whole Blood 

In one embodiment, a blood sample of ttie mvention is whole blood without prior 
fractionation. Bv whole blood is meant blood which is unfra ctionated. Whole blood 

15 includes a drop of blood, a pinprick of blood. Whole blood also includes blood in which 
the serum or plasma is removed. Whole blood without prior fractionation can be used 
directly, or one can remove the serum or plasma and isolate RNA or mRNA from the 
remaining blood sample in accordance with methods well known in the art. The use of 
whole blood without fractionation is preferred since it avoids the costly and time-consuming 

20 need to separate out the cell types within the blood (Kimoto Kimoto Y (1998) MoL Gen. 
Genet 258:233-239; Chelly J et at. (1989). Proc, Nat Acad Set USA, 86:2617-2621; Chelly 
J et al (1988). Nature 333:858-860). In a preferred embodiment, the whole blood sample can 
have the plasma or serum removed by centriftigation, using preferably gentle centrifiigation 
at 300-800xg for five to ten minutes.„In another preferred embodiment, lysis buffer is added 

25 to the wholeblood sample without prior fractionation, prior to extraction of RNA. Lysis 
Buffer (IL) 0.6g EDTA; 1 .Og KHCO2, 8.2g NH4CI adjusted to pH 7.4 (using NaOH). Once 
mixed with lysing buffer, the sample can be centrifiiged and ttie cell pellet containing the 
RNA or mRNA extracted in accordance with methods known in the art (see for example 
Sambrook et al.) 

30 

Peripheral Blood Leukocytes fPBLs) 

In another embodiment, a blood sample of the invention is a sample of peripheral 
blood leukocytes (PBLs). Whole blood without prior fractionation is obtained from a normal 
patient or from an individual diagnosed with, or suspected of having a disease or condition. 
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according to methods of phlebotomy well known in the art. PBLs are separated from the 
remainder of the blood using methods known in the art. For example, PBLs can be separated 
using a Ficoll® gradient . 

5 In another embodiment, a blood sample of the invention is a sample of granulocytes. 

In another embodimmt, a blood sample of the invention is a sample of neutrophils, 
eosinophils, basophils or any combination thereof. In another embodiment, a blood sample 
of the invention is a sample of agranulocytes. In another embodiment, a blood sample of the 
mvention is a sample of lympocytes, monocytes or a combination thereof. In yet another 

10 embodiment, a blood sample of the invention is a sample of T lymphocytes, B lymphocytes 
or a combination thereof 

In one aspect, a whole blood sample without prior fractionation is obtained from a 
normal patient or from an individual diagaosed with, or suspected of having, a disease or 

15 condition according to methods of phlebotomy well known in the art. A whole blood sample 
without prior fractionation that is usefiil according to the invention is in an amount that is 
sufficient for the detection of one or more nucleic acid sequences according to the invention. 
In a preferred embodiment, a whole blood sample without prior fractionation is in an amotmt 
ranging from 1 |il to 100ml, more preferably 10 ^il to 50 ml, even more preferably 10 p.1 to 

20 25ml and most preferably 10 ^il to l.ml. 

Quantitation of RNA using Microarrav Analysis 

In one embodiment of the invention, the expression levels of transcripts from 
individuals or populations of individuals having a condition, or not having a condition are 
25 measured using an array. In a preferred embodiment either a cDNA based microarray or an 
oligonucleotide based microarray are used, for example, the ChondroChip™ or the 
Affymetrix GeneChip® U133A, U133B orU133 Plus version are utilized. 

Microarray hybridization experiments utilizing the Affymetrix® GeneChip® 
platforms (U133A and U133 Plus 2.0) raicroarray's are preferably performed in accordance 
30 with the Affymetrix® instructions. 
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Microarray hybridization experiments utilizing the ChondroChip™ are preferably 
performed as described below. 



Preparation of Fluorescent DNA Probe from mJtNA 

Fluorescently labeled target nucleic; acid samples are prepared for analysis with an 
5 array of the invention. 

In one embodiment of the invention, labeled cDNA is prepared for hybridization to 
the ChondroChip™ microarray using 2\xg Oligo-dT primers annealed to 2|xg of mRNA 
isolated from a blood sample of a patient in a total volume of 15pl, by heating to TO'^C for 10 
ntiin, and cooled on ice. 

10 In another embodiment of the invention, 20 ug of total RNA can be utilized for 

preparation of labeled cDNA for purposes of hybridization. 

In another embodiment of the invention, RNA can be amplified (aRNA) from either 
total RNA or mRNA. In a preferred embodiment aRNA is made from total RNA. Total 
RNA is extracted with TRIzol as stated previously. 0.1 0.5 ug total RNA from each sample 
15 is then subjected to RNA amplification using RNA Amplification Kit (Arcturus, Catalog 
#KIT0201) following the user guide. 2.5 ug amplified RNA was then used for probe labeling 
by reverse transcription with 1 mM Cy3 or Cy5 (Pharmacia). The protocol used for 
hybridization was based on that described previously (H.Zhang 2002). 

20 The mRNA is reverse transcribed by incubating the sample at 42^C for 1 .5-2 hours in 

a 100 |xl volume containing a fmal concentration of 50mM Tris-HCl (pH 8.3), 75mM KCl, 
3mM MgCl2, 25mM DTT, 25mM unlabeled dNTPs, 400 units of Superscript II (200U/^L, 
Gibco BRL), and 1 5mM of Cy3 or Cy5 (Amersham). RNA is then degraded by addition of 
15p.l of O.IN NaOH, and incubation at 70DC for 10 min. The reaction mixture is neutralized 

25 by addition of 1 5^1 of O.IN HCl, and the volume is brought to 500^1 with TE (lOmM Tris, 
ImM EDTA), and 20 ug of Cotl human DNA (Gibco-BRL) is added. 

The labeled target nucleic acid sample is purified by centrifiigation in a Centricbn-30 
micro-concentrator (Amicon). If two different target nucleic acid samples (e.g., two samples 
derived from a healthy patient vs. patient with a disease) are being analyzed and compared by 
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' hybridization to the same array, each target nucleic acid sample is labeled with a different 
fluorescent label (e.g., Cy3 and Cy5) and separately concentrated. The separately 
concentrated target nucleic acid samples (Cy3 and Cy5 labeled) are combined into a fresh 
centricon, washed with 500nl TE, and concentrated again to a volume of less than 7fAl. IpL 

5 of lOng/fil polyA RNA (Sigma, #P9403) and Ijil of lOjig/^1 tRNA (Gibco-BRL, #15401- 
01 1) is added and the volume is adjusted to 9.5\i\ with distilled water. For final target nucleic 
acid preparation 2.1^11 20XSSC(1.5MNaa, 15QmMNaCitrate(pH8.0)) and0.35til 10%SDS 
is added. 

Hybridization 

10 Labeled nucleic acid is denatured by heating for 2 min at 100°C, and incubated at 

3TC for 20-30 min before being placed on a nucleic acid array imder a 22mm x 22mm glass 
cover slip. Hybridization is carried out at 65^C for 14 to 1 8 hours m a custom slide chamber 
with humidity maintained by a small reservoir of 3XSSC. The array is washed by 
submersion and agitation for 2-5 min in 2X SSC witii 0.1%SDS, followed by IX SSC, and 

15 O.IX SSC. Finally, the array is dried by centrifugation for 2 min in a slide rack in a Beckman 
GS-6 tabletop centrifuge in Microplus carriers at 650 RPM for 2 min. 

Signal Detection And Data Generation 

Following hybridization of an array with one or more labeled target nucleic acid 
samples, arrays are scanned immediately using a GMS Seamier 418 and Scanalyzer software 
20 (Michael Eisen, Stanford University), followed by GeneSpring™ software (SiUcon Genetics, 
CA) analysis. Alternatively, a GMS Scanner 428 and Jaguar software may be used followed 
by GeneSpring™ software analysis. 

If one target nucleic acid sample is analyzed, the sample is labeled with one 
fluorescent dye (e.g., Cy3 or Cy5). 

25 After hybridization to a microarray as described herein, fluorescence intensities at the 

associated nucleic acid members on the microarray are determined from images taken with a 
custom confocal microscope equipped with laser excitation sources and interference filters 
appropriate for the Cy3 or Cy5 fluorescence. 
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The presence of Cy3 or Cy5 fluorescent dye on the microairay indicates hybridization 
of a target nucleic acid and a specific nucleic acid member on the microairay. The intensity 
of Cy3 or Cy5 fluorescence represents the amount of target nucleic acid which is hybridized 
to the nucleic acid member on the microarray, and is indicative of the expression level of the 
5 specific nucleic acid member sequence in the target sample. 

After hybridization, fluorescence intensities at the associated nucleic acid members on 
the microarray are determined fi"om images taken with a custom confocal microscope 
equipped with laser excitation sources and interference filters appropriate for the Cy3 and 
Cy5 fluors. Separate scans are taken for each fluor at a resolution of 225 |im^ per pixel and 
10 65,536 gray levels. Normalization between the images is used to adjust for the different 
efficiencies in labeling and detection with the two different fluors. This is achieved by 
manual matching of the detection sensitivities to bring a set of internal control genes to nearly 
equal intensity followed by computational calculation of the residual scalar required for 
optimal intensity matching for this set of genes. 

15 The presence of Cy3 or Cy5 fluorescent dye on the microarray indicates hybridization 

of a target nucleic acid and a specific nucleic acid member on the microarray. The intensities 
of Cy3 or Cy5 fluorescence represent the amount of target nucleic acid which is hybridized to 
the nucleic acid member on the microarray, and is indicative of the expression level of the 
specific nucleic acid member sequence in the target sample. If a nucleic acid member on the 

20 array shows no color, it indicates that the element is not expressed in sufficient levels to be 
detected in either sample. If a nucleic acid member on the array shows a single color, it 
indicates that a labeled gene is expressed only in that cell sample. The appearance of both 
colors indicates that the gene is expressed in both tissue samples. The ratios of Cy3 and CyS 
fluorescence mtensities, after normalization, are indicative of differences of expression levels 

25 of the associated nucleic acid member sequence in the two samples for comparison. A ratio of 
expression not equal to 1.0 is used as an indication of differential gene expression. 

The array is scaimed in the Cy 3 and Cy5 channels and stored as separate 16-bit TIFF 
images. The images are incorporated and analyzed using Scanalyzer™ software which 
includes a gridding process to capture the hybridization intensity data from each spot on the 
30 array. The fluorescence intensity and background-subtracted hybridization intensity of each 
spot is collected and a ratio of measured mean intensities of CyS to Cy3 is calculated. A 
linear regression approach is used for normalization and assumes that a scatter plot of the 
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measured Cy5 versus Cy3 intensities should have a slope of one. The average of the ratios is 
calculated and used to rescale the data and adjust the slope to one. A post-normalization 
cutoff of a ratio not equal to LO-is used to identify diflFerentially expressed genes. 

Annotation to Identify those RNA transcripts whifh are differentially expressed in blood 

5 In one aspect of the invention, Afiymetrix(g> GeneChip® platforms (U133A and U133 

Plus 2.0) microarrays are used. As would be understood by a person skilled in the art, each 
"gene ED" on an Affymetrix® microarray represents a number of oligonucleotide probe pairs 
corresponding to a region of transcribed ma, each probe pair consists of a matched and a 
mismatched oligonucleotide, wherein the matched oUgonucleotide is 100% complementary to 

10 a RNA which is transcribed in humans. The mismatched oligonucleotide is less than 100% 
complementary to a region of a gene or a region of RNA which is transcribed in himians. 
Microarrays of the invention useful for identifying biomarkers for human conditions include 
the U95 array, the U133A array, the U133B array or the U133 plus 2.0 array. As would be 
understood by a person skilled in the art, the term "gene ID" can also be termed "spot 

15 number" or "spot ID" or "probe set ID", An example of a gene ID used by Affymetrix® is 
160020_at; 1494Xat.; or 200003_s_at. 

Gene ID's are annotated by Affymetrix and the results of the annotation are available 
on the Affymetrix website at www.affvmetrix.com . As used herein "aimotation" when used 
in the context of the Affymetrix® microarray is the information which allows one to identify 
20 the expressed RNA and, if appUcable, the resulting protein translated by the expressed RNA 
which is being measured as a result of the binding of RNA to the probe pairs of the 
microarray. The annotation master table for the Afiymetrix human microarrays is disclosed 
in Table 8A. Details as to the annotation provided in Table 8A are shown below in Table 9. 
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Table 9 



A£fymetrix 


ISkChondroChip 




Probe Set ID 


CloneK) 


Afiymetrix ID for the probe or 
ChondroGene's cDNA clone 
ID 


Target Description 


Target Description 


Description of the represented 
gene 


Representative Public ID 


Accession 


Genbank (or internal in the 
case of some Affy IDs) 
database identiiier(s) for the 
represented gene 


Overlapping Transcripts 




Details of overlapping 
transcripts found in a 
chromosomal region that 
alipTi*5 with a target seauence 




Aliases 


Gene name synonyms. 


Gene Title 


Gene Title 


Name of represented gene. 


Gene Symbol 


Gene Symbol 


Official symbol of represented 
gene. 


UniGenelD 


UniGene ID 


The identifier for the UniGene 
cluster to which the 
represented gene belongs. 


Ens^bl 




Ensembl database identifier 
for the represented gene. 
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LocxisLink 


LocusLink 


LocusLink database 
identifier(s) for the 
represented gene. 


SwissProt 




SwissProt database 
identifier(s) for the 
represented gene. 


RefSeq Protein ID 


RefSeq Protein ID 


Protein Reference Sequence 
database identifier(s) for the 
represented gene. 


RefSeq Transcript ID 




Transcript Reference 
Sequence database 
identifier(s) for the 
represented gene. 



In anotiier aspect of the invention, cDNA based airays such as the ChondroChip™ are 
used. Sequences coireq)onding to EST sequences are spotted onto the microairay. 
Sequences used include ttiose previously identified using cartilage tissue library clones as 

5 outlined in H. Zhang etal.05/eoar^Anttsrfln</Cartf/flg^e (2002) 10, 950-960. The 

differentially expressed EST sequences of the microairays of the invention are annotated by 
searching against available databases, including the "nt", "nr", "est", "gss" and "htg" data 
bases available through NCBI to determine putative identities for ESTs matching to known 
genes or other ESTs. Functional characterisation of ESTs with known gene matches are 

10 made according to any known method. Preferably, differentially expressed EST sequences 
are compared to the non-redundant Genbank/EMBL/DDBJ and dbEST databases using the 
BLAST algorithm (Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local 
alignment search tool. J Mol Biol 1990;215:403-10). A minimum value of P =10""* and 
nucleotide sequence identity >95%, where the sequence identity is non-contiguous or 

15 scattered, are required for assignments of putative identities for ESTs matching to known 
genes or to other ESTs. Construction of a non-redundant list of genes represented in the EST 
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set is done with the help of Unigene, Entrez and PubMed at the National Center for 
Biotechnology Inforaiation (NCBI) web site at www.ncbi.nlm.nih.gov. 



Genes are identified Scorn ESTs according to known methods. To identify novel 
genes firom an EST sequence, the EST should preferably be at least 100 nucleotides in length, 
5 and more preferably 1 50 nucleotides in length, for annotation. Preferably, the EST exhibits 
open reading fi:ame characteristics (i.e., can encode a putative polypq)tide). 

Having identified an EST corresponding to a larger sequence, other portions of the 
larger sequence which comprises the EST can be used in assays to elucidate gene fimction, 
e.g., to isolate polypeptides encoded by the gene, to generate antibodies specifically reactive 
10 with these polypeptides, to identify binding partners of the polypeptides (receptors, Ugands, 
agonists, antagonists and the like) and/or to detect the expression of the gene (or lack thereof) 
in healthy or diseased individuals. 

In another aspect, the invention provides for nucleic acid sequences that do not 
demonstrate a "significant match" to any of the publicly known sequences in sequence 

15 databases at thq time a query is done. Longer genomic segments comprising these types of 
novel EST sequences can be identified by probing genomic libraries, while longer expressed 
sequences can be identified in cDNA libraries and/or by performing polymerase extension 
reactions (e.g., RACE) using EST sequences to derive primer sequences as is known in the 
art. Longer firagments can be mapped to particular chromosomes by FISH and other 

20 techniques and their sequences compared to known sequences in genomic and/or expressed 
sequence databases. 

The amino acid sequraces encoded by the ESTs can also be used to search databases, 
such as GenBank, S WISS-PROT, EMBL database, PIR protein database, Vecbase, or 
GenPept for the amino acid sequences of the corresponding full-length genes according to 
25 procedures well known in the art. 

Alternative methods for analysing ESTs are also available. For example, the ESTs 
may be assembled into contigs with sequence alignment, editing, and assembly programs 
such as PHRED and PHRAP (Ewing, et al., 1998, Genome Res. 3:175, incorporated herein; 
and the web site at bozeman.genome.washington.edu). Contig redundancy is reduced by 
JO clustering nonoverlapping sequence contigs using the EST clone identification number, 
which is common for the nonoverlapping 5 and 3 sequence reads for a single EST cDNA 
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clone. In one aspect, the consensus sequence jfrom each cluster is compared to the non- 
redundant Genbank/EMBL/DDBJ and dbEST databases using the BLAST algorithm with the 
help of unigene, Entrez and PubMed at the NCBI site. 

EST clones used to spot onto the ChondroChip™ have been annotated using the 
5 methods described above. Results are reported by clone name and the annotation disclosed 
in the ChondroChip™ Master Annotation Table 8B. As used herein "annotation" when used 
in the context of the ChondroChip™ allows one to identify the expressed RNA and, if 
applicable, the resulting protein translated by the expressed RNA which is being measured as 
a result of the binding of RNA to ChondroChip™ microarray. The details of the annotation 
10 shown in Table 9 above. 

Measure of level of species of transcripts in Blood Using Quantitative Real Time RT- 
PCR 

In another aspect of the invention, the level of one or more species of transcripts of 
15 the invention can be determined using quantitative methods including QRT-PCR, RNA from 
blood (either whole blood without prior fractionation, , peripheral blood leukocytes, PBMCs 
or another subfraction of blood) using quantitative reverse transcription (RT) in combination 
with the polymerase chain reaction (PGR). 

20 Total RNA, or mRNA from blood is used as a template and a primer specific to the 

transcribed portion of a gene of the invention is used to initiate reverse transcription. Primer 
design can be accomplished utilizing commercially available software (e.g. Primer Designer 
1.0, Scientific Sofware etc.)- The product of the reverse transcription is subsequently used as 
a template for PCR, 

25 PCR provides a method for rapidly amplifying a particular nucleic acid sequence by 

using multiple cycles of DNA replication catalyzed by a thermostable, DNA-dependent DNA 
polymerase to amplify the target sequence of interest. PCR requires the presence of a nucleic 
acid to be amplified, two single-stranded oligonucleotide primers flanking the sequence to be 
amplified, a DNA polymerase, deoxyribonucleoside triphosphates, a buffer and salts. 

30 

The method of PCR is well known in the art. PCR, is performed as described in 
MulUs and Faloona, 1987, Methods EnzymoL, 155: 335, herein incoiporated by reference. 



43 



wo 2004/1 12589 PCT/US2004/020836 

PGR is performed using template DNA or cDNA (at least Ifg; more usefully, 1-1000 
ng) and at least 25 pmol of oligonucleotide primers. A typical reaction mixture includes: 2[il 
of DNA, 25 pmol of oligonucleotide primer, 2.5 nl of lOD PGR buffer 1 (Perkin-Elmer, 
Foster Gity, GA), 0.4 \d of 1.25 \M dNTP, 0.15 ^1 (or 2.5 units) of Taq DNA polymerase 
5 (Perkin Elmer, Foster City, CA) and deionized water to a total volume of 25 pJ. Mineral oil 
is overlaid and the fCR is performed using a programmable thermal cycler. 

The length and temperature of each step of a PGR cycle, as well as the number of 
cycles, are adjusted according to the stringency requirements in effect. Annealing 

.0 temperature and timing are determined both by the efQciency wi& which a primer is expected 
to anneal to a template and the degree of mismatch that is to be tolerated. The ability to 
optimize the stringency of primer annealing conditions is well within the knowledge of one of 
moderate skill in the art. An annealing temperature of between 30^C and 72°C is used. 
Initial denaturation of the template molecules normally occurs at between 92°G and 99°G for 

15 4 minutes, followed by 20-40 cycles consisting of denaturation (94-99°G for 15 seconds to 1 
minute), anneaUng (temperature determined as discussed above; 1-2 minutes), and extension 
(72''C for 1 minute). The final extension step is generally carried out for 4 minutes at 72°G, 
and may be followed by an indefinite (0-24 hour) step at 4*^G. 

20 QRT-PCR which is quantitative in nature can also be performed, using either reverse 

transcription and PGR in a two step procedure, or reverse transcription combined with PGR 
in a single step protocol so as to provide a quantitative measure of the level of one or more 
species of RNA transcripts in blood. One of these techniques, for which there are 
commercially available kits such as Taqman® (Perkin Ehner, Foster Gity, GA), is perfomied 

25 with a transcript-specific antisense probe. This probe is specific for the PGR product (e.g. a 
nucleic acid fragment derived firom a gene) and is prepared with a quencher and fluorescent 
reporter probe complexed to the 5' end of the oligonucleotide. Different fluorescent markers 
are attached to different reporters, allowing for measurement of two products in one reaction. 
When Taq DNA polymerase is activated, it cleaves off the fluorescent reporters of the probe 

30 bound to the template by virtue of its 5 *-to-3 ' exonuclease activity. In the absence of the 
quenchers, the reporters now fluoresce. The color change in the reporters is proportional to 
the amount of each specific product and is measured by a fluorometer; therefore, the amount 
of each color is measured and the PCH product is quantified. The PGR reactions are 
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perfomed in 96 well plates so that samples derived from many individuals are processed and 
measured simultaneously. The Taqman® system has the additional advantage of not 
requiring gel electrophoresis and allows for quantification when used with a standard curve. 

5 A second technique useful for detecting PCR products quantitatively without is to use 

an intercolating dye such as flie connnercially available QuantiTect™ SYBR® Green PCR 
(Qiagen, Valencia California). RT-PCR is performed using SYBR® green as a fluorescent 
label which is incorporated into the PCR product during the PCR stage and produces a 
flourescense proportional to the amount of PCR product. 

10 

Both Taqman® and QuantiTect™ SYBR® systems can be used subsequent to reverse 
transcription of RNA. 

Additionally, other systems to quantitatively measure the level of one or more species 
15 of transcripts are known including Molecular Beacons® which uses a probe having a 
fluorescent molecule and a quencher molecule, the probe capable of forming a hairpin 
structure such that when in the hairpin forai, the fluorescence molecule is quenched, and 
when hybridized the flourescense increases giving a quantitative measurement of one or more 
species of RNA transcripts. 

20 

Several other techniques for detectingPCR products quantitatively without 
electrophoresis may also be used according to the invention (see for example PCR Protocols, 
A Guide to Methods and Applications^ Innis et al.. Academic Press, Inc. N.Y., (1990)). 

25 Identification of Useful Biomarkers 

Using techniques which allow comparison as to the levels of one or more species of 
RNA transcripts in blood as described herein, one can identify useful biomarkers of a 
condition. For example one can identify those biomarkers which identify differential levels 
of one or more species of transcripts as between, for example, an individual or a population 
30 of individuals having a condition and an iadividual or a population of individuals not having 
a condition. 
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When comparing two or more saiiq)les for differences, results are reported as 
statistically significant when there is only a small probability that similar results woidd have 
been observed if the tested hypothesis (i.e., the RNA transcripts are not expressed at different 
levels) were not true. A small probability can be defined as the accepted threshold level at 
5 which the results being compared are considered significantly different. The accepted lower 
threshold is set at, but not limited to, 0.05 (i.e., there is a 5% hkelihood that the results would 
be observed between two or more identical populations) such that any values determined by 
statistical means at or below this threshold are considered significant 

When comparing two or more samples for similarities, results are reported as 
10 statistically significant when there is only a small probability that similar results would have 
been observed if the tested hypothesis (i.e., the genes are not expressed at different levels) 
were not true. A small probability can be defined as the accepted threshold level at which the 
results being compared are considered significantly different. The accepted lower threshold is 
set at, but not limited to, 0.05 (i.e., there is a 5% likelihood that the results would be observed 
15 between two or more identical populations) such that any values determined by statistical 
means above this threshold are not considered significantly different and thus similar. 

Preferably the identification of biomarkers is done using statistical analysis. For 
example, the Wilcox Maim Whitney rank sum test or a standard modified t-test such as a 
permutation t-test can be used. Additionally multigroup comparisons can also be done when 

20 there are three or more reference populations. In this case one can use statistical tests such as 
ANOVA or Kruskal WaUis which can then be analyzed using a post-hoc pairwise test such 
as the t-test, the Tukey test, or the student-Newman-Keuls test. Other multiclass comparison 
tests can also be used as would be understood by a person skilled in the art. See for example 
(Sokal and Rohlf (1987) Introduction to Biostatistics 2"^ edition, WH Freeman, New York), 

25 Yeung and Bumgamer, Multiclass classification of microarray data with repeated 

measurements: application to cancer Genome Biology 2003, 4:R83; Breiman, L. (2001) 
Statistical Modeling, the two cultures Statistical Science 16(3) 199-231 which are 
incorporated herein in their entirety. 

In order to facilitate ready access, e.g. for comparison, review, recovery and/or 
30 modification, the expression profiles of patients with a condition or without a condition can 
be recorded m a database, whether in a relational database accessible by a computational 
device or other format, or a manually accessible indexed file of profiles as photographs. 
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analogue or digital imaging, readouts spreadsheets etc. Typically the database is compiled 
and maintained at a central facility, with access being available locally and/or remotely. 



As would be understood by a person skilled in the art, comparison as between the the 
level of one or more species of transcripts in blood as illustrated by an expression profileof a 

5 test mdividual suspected of having a condition of interest, with that of individuals with 
thecondition of interest, as well as an analogous comparison of expression profiles between 
individuals with a certain stage or degree of progression of a disease condition, without said 
condition, or a healthy ("normal") individual ,so as to diagnose or prognose said test 
individual can occur via expression profiles generated concurrently or non concurrently. It 

10 would be understood that a database would be usefiil to generate said comparisoa 

As additional test samples from test patients are obtained, through clinical trials, 
further mvestigation, or the like, additional data can be determined in accordance with the 
methods disclosed herein and can likewise be added to a database to provide better reference 
data for comparison of healthy and/or non-disease patients and/or certain stage or degree of 
15 progression of a disease as compared with the test patient sample. 

The ability to combine biomarkers provides an even greater potential to help 
distinguish as between two populations so as to allow diagnosis of a disease or condition, hi 
order to identify useful combitions of biomarkers, each potential combination or set of 
biomarkers are evaluated for their ability to diagnose an unknown as having, or not having a 
20 specific condition. 

The diagnosing or prognosing may thus be performed by detecting the expression 
level of one gene, two or more genes, three or more genes, four or more genes, five or more 
genes, six or more genes, seven or more genes, eight or more genes, nine or more genes, ten 
or more genes, fifteen or more genes, twenty or more genes thirty or more genes, fifty or 
25 more genes, one hundred or more genes, two hundred or more genes, three hundred or more 
genes, five hundred or more genes or all of the genes disclosed for the specific condition in 
question. 

Use of Expression Profiles for Diagnostic Purposes 
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As woxild be understood to a person skilled in the art, one can utilize sets of 
biomarkers which have been identified as statistically significant as described above in order 
to characterize an unknown sample as having said disease or not having said disease. This is 
commonly termed "class prediction". 

5 Methods that can be used for class prediction analysis have been well described and 

generally involve a training phase using samples with known classification and a testing 
phase fi-om which the algorithm generalizes firom the training data so as to predict 
classification of unknown samples (see for Example Slonim, D. (2002), Nature penetics 
Supp. Vol 32 502-8, Raychaudhuri et al. (2001) Trends Biotechnol 19: 189-193; Khan et al, 

10 (2001) Nature Med. 7 673-9.; Golub et al. (1999) Science 286: 531-7. Hastie et al. (2000) 
Genome Biol. 1(2) Research 0003.1-0003.21 all of which are incorporated herein by 
reference in their entirety ). 

Use of Expression Profiles to Predict Disease State 

15 One can also utilize sets of genes which have been identified as producing difTerential 

levels of transcripts in blood which are statistically significant as described above in order to 
predict whether an asymptomatic individual will develop symptoms of said condition or 
whether an individual with an early stage of a disease condition will develop a later stage of a 
disease condition. 

20 For example, as a result of analyzing over 780 individuals, we have surprisingly 

shown that almost all individuals in the 56 and over age group have either moderate, marked 
or severe OA, and fiurthermore.that ahnost all mdividuals in the 61 and over age group have 
either marked or severe OA only (see Figure 35 and Table 3AE) even though there remain 
approximately 50% of Canadians over the age of 65 who do not show symptoms of 

25 osteoarthritis (Statistics Canada, Canadian Connnunity Health Survey, 2000/2001). This data 
indicates that individuals with mild OA have a significantly increased chance of progressing 
to marked or severe OA as compared with individuals who do not have mild OA. 
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As a result, one can utilize the methods of class prediction analysis described herein 
in order to determine whether an individual will develop late stage OA by identifying 
individuals with early stages of OA. 

As additional samples are obtained, for example during clinical trials, their expression 
5 profiles can be determined and correlated with the relevant subject data in the database and 
likewise be recorded in said database. Algorithms as described above can be used to query 
additional samples against the existing database to further refine the predictive.detennination 
by allowing an even greater association between the prediction of OA and one or more 
species of RNA transcripts signature. 

10 The prediction of late stage OA may thus be peifonned by detecting the level of 

transcripts expressed by two or more genes, three or more genes, four or more genes, five or 
more genes, six or more genes, seven or more genes, eight or more genes, nine or more 
genes, ten or more genes, fifteen or more genes, twenty or more genes thirty or more genes, 
fifty or more genes, one hundred or more genes, two hundred or more genes, three hundred or 

15 more genes, five hundred or more genes or all of the genes disclosed for identifying mild OA. 
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Table 1 shows genes that are differentially expressed in blood samples from patients with a 
disease or patients who are co-moibid as compared to blood samples fiom healthy patients or 
5 patients without said disease, or with only one of said co-morbid diseases 

Table 1 A shows the identity of those genes that are differentially expressed in blood 
samples &om patients with osteoarthritis and hypertension as compared with noimal 
patients using the ChondroChip™ platform. 

Table IB shows the identity of those genes that are differentially expressed in blood 
10 samples from patients with osteoarthritis and obesity as compared with normal patients 
using the ChondroChip™ platform. 

Table IC shows the identity of those genes that are differentially expressed in blood 
samples from patients with osteoarthritis and allergies as compared with normal patients 
using the ChondroChip™ platform. 

15 Table ID shows the identity of those genes that are differentially expressed in blood 

samples from patients witiit)Steoarthritis and subject to systemic steroids as compared with 
normal patients using the ChondroChip™ platfonn. 

Table IE shows the identity of those genes that are differentially expressed in blood 
samples &om patients with hypertension as compared to non hypertension patients using 
20 the ChondroChip™ platfonn. 

Table IF shows the identity of those genes that are differentially expressed in blood 
samples from patients obesity as compared to non obese patients using the ChondroChip™ 
platform. 

Table IG shows the identity of those genes that are differentially expressed in blood samples 
25 from patients with hypertension and OA when compared with patients who have OA only 
wherein genes identified in Table 1 A have been removed so as to identify genes which are 
unique to hypertension. 
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Table IH shows the identity of those genes which were identified in Table 1 A which are 
shared with fliose genes differentially expressed in blood samples firom patients with 
hypertension and OA when compared with patients who have OA only. 

Table II shows the identity of those genes that are differentially expressed in blood samples 
5 fiom patients who are obese and have OA when compared with patients who have OA only 
and wherein genes identified in Table IB have been removed so as to identify genes which 
are unique to obesity. 

Table IJ shows the identify of those genes identified in Table IB which are shared with 
those genes differentially expressed in blood samples fix>m patients who are obese and have 
10 OA when compared with patients who have OA. 

Table IK shows the identity of those gen^ that are differentially expressed in blood samples 
jfrom patients with allergies and OA when compared with patients who have OA only 
wherein genes identified in Table 10 have been removed so as to identify genes which are 
unique to allergies. 

15 Table IL shows th^ identify of those genes identified in Table 30 which are shared witii 
those genes differentially expressed in blood samples firom patients with allergies and OA 
when compared with patients who have OA only. 

Table IM shows the identity of those genes that are differentially expressed in blood samples 
from patients who are on systemic steroids and have OA when compared with patients who 
20 have OA only wherem genes identified in Table ID have been removed so as to identify 
genes which are unique to patients on systemic steroids. 

Table IN shows the identify of those genes identified in Table ID which are shared wifli 
those genes differentiaUy expressed in blood samples from patients who are on systemic 
steroids and have OA when compared with patients who have OA only. 

25 Table lO shows the identity of those genes that are differentially expressed in blood firom 
patients taking either burth control, prednisone or hormone replacement therapy and 
presenting with OA using the ChondroChip™ platform. 
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Table IP shows the identity of those genes that are differentially expressed in blood 
samples firom patients with type n diabetes as compared to patients without type n diabetes 
using the ChondroChip™ platform. 

Table IQ shows the identity of those genes that are differentially expressed in blood 
5 samples jfrom patients with HyperUpidemia as compared to patients without Hyperlipidemia 
using the ChondroChip™ platfonn. 

Table IR shows the identity of those genes that are differentially expressed in blood 
samples Scorn patients with limg disease as compared to patients without lung disease using 
the ChondroChip™ platfonn. 

10 Table IS shows the identity of those genes that are differentially expressed in blood 

samples firom patients with bladder cancer as compared to patients without bladder cancer 
using the ChondroChip™ platform. 

Table IT shows the identity of those genes that are differentially expressed in blood 
samples from patients with early stage bladder cancer, late stage bladder cancer or non- 
15 bladder cancer using the ChondroChip™ platfomi. 

Table lU shows the identity of those genes that are diflFerentially expressed in blood 
samples from patients with coronary artery disease (CAD) as compared to patients not 
having CAD using the ChondroChip™ platfonn. 

Table IV shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with rheumatoid arthritis as compared to patients not having 
rheumatoid arthritis using the ChondroChip™ platform . 

Table 1 W shows the identity of those genes that are differentially expressed in blood 
samples from patients with rheumatoid arthritis as compared to patients not having 
rheumatoid arthritis using the Affymetrix® platfonn . 

25 Table IX shows the identity of those genes that are differentially expressed in blood 
samples from patients with depression as compared with patients not having depression 
usmg the ChondroChip™ platfonn. 
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Table lY shows the identity of those genes that are differentially expressed in blood 
samples from patients with various stages of osteoarthritis using the ChondroChip™ 
platform. 

Table IZ shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with liver cancer as compared with patients not having liver cancer 
using the Affymetrix® platform. . 

Table lAA shows the identity of those genes that are differentially expressed in blood 
samples from patimts with schizophrenia as compared with patients not having 
schizophrenia using the Afiymetrix® platform. 

10 Table 1 AB shows the identity of those genes that are differentially expressed in blood 
samples from patients with Chagas disease as compared with patients not having Chagas 
disease using the Afiymetrix® platform. 

Table lAC shows the identity of those genes that are differentially expressed in blood 
samples from patients with asthma as compared with patients not having asthma using the 
15 ChondroChip™, 

Table IAD shows the identity of those genes that are differentially expressed in blood 
samples from patients with asthma as compared with patients not having asthma using the 
Affymetrix® platfonn. 

Table lAE shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with lung cancer as compared with patients not having lung cancer 
using the Affymetrix® platform. 

Table lAG shows the identity of those genes that are differentially expressed m blood 
samples from patients with hypertension as compared with patients not having hypertension 
using the Affymetrix® platfonn. 

25 Table lAH shows the identity of those genes that are differentially expressed in blood 
samples from patients with obesity as compared with patients not having obesity using the 
Affymetrix® platform. 



55 



wo 2004/1 12589 PCT/US2004/020836 

Table 1 AI shows the identity of those genes that are differentially expressed in blood 
samples from patients with ankylosing spondylitis using the Afifymetrix® platform. 

Table 2 shows the identity of those genes that are differentially expressed in blood from 
patients with either mild or severe OA, but for which genes relevant to asthma, obesity, 
5 hypertension, systemic steroids and allergies have been removed. 

Table 3 shows those genes that are differentially expressed in blood samples from patients 
with a first disease as compared to blood samples from patients with a second disease so as 
to allow differential diagnosis as between said first and second disease. 

Table 3A shows the identity of those genes that are differentially expressed in blood from 
10 patients with schizophrenia as compared with manic depression syndrome (MDS) using the 
AfEymetrix® platform. 

Table 3B shows the identity of those genes that are differentially expressed in blood from 
patients with hepatitis as compared with liver cancer using the A^metrix® platform. 

Table 3C shows the identity of those genes that are differentially expressed in blood from 
15 patients with bladder cancer as compared with liver cancer using the Afiymetrix® platform. 

Table 3D shows the identity of those genes that are dififerentialjy expressed in blood &om 
patients with bladder cancer as compared with testicular cancer using the Afifymetrix® 
platform. 

Table 3E shows the identity of those genes that are differentially expressed in blood from 
20 patients with testicular cancer as compared with kidney cancer using the Affymetrix® 
platform. 

Table 3F shows the identity of those genes that are differentially expressed in blood from 
patients with liver cancer as compared with stomach cancer using the Affymetrix® 
platform. 

25 Table 3G shows the identity of those genes that are differentially expressed in blood from 
patients with liver cancer as compared with colon cancer using the Affymetrix® platform. 
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Table 3H shows the identity of those genes that are differentially expressed in blood from 
patients with stomach cancer as compared with colon cancer using the Aflfymetrix® 
platform. 

Table 31 shows the identity of those genes that are differentially expressed in blood from 
5 patients Avith Rhexmiatoid Arthritis as compared with Osteoarthritis using the Afiymetrix® 
platform. 

Table 3K shows the identity of those genes that are differentially expressed in blood from 
patients with Chagas Disease as compared with Heart Failure using the Afifymetrix® 
platfonn. 

10 Table 3L shows the identity of those genes that are differentially expressed in blood from 
patients with Chagas Disease as compared with Coronary Artery Disease using the 
Afiymetrix® platform. 

Table 3N shows the identity of those genes that are differentially expressed in blood from 
patients with Coronary Artery Disease as compared with Heart Failure using the 
15 Afiymetrix® platform. 

Table 3P shows the identity of those genes that are differentially expressed in blood from 
patients with Asymptomatic Chagas Disease as compared with Symptomatic Chagas 
Disease using the Afiymetrix® platfonn. 

Table 3Q shows the identity of those genes that are differentially expressed in blood from 
20 patients with Alzheimer's' as compared with Schizophrenia using the Afiymetrix® 
platfonn. 

Table 3R shows the identity of those genes that are differentially expressed in blood from 
patients with Alzheimer's' as compared with Manic Depression Syndrome using the 
Afiymetrix® platform. 

25 Table 4 shows those genes that are differentially expressed in blood samples from patients 
with a stage of Osteoarthritis as compared to blood samples from patients with a second 
stage of Osteoarthritis so as to allow monitoring of progression and/or regression of 
disease. 
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Table 4A shows the identity of those genes that are differentiaUy expressed in blood iBrom 
patients with Osteoarthritis as compared with patients without Osteoarthritis using the 
ChondroChip™ platform. 

Table 4B shows the identity of those genes that are differentially expressed in blood from 
patients with Osteoarthritis as compared with patients without Osteoarthritis using the 
Asymetrix® platform. 

Table 4C shows the identity of those genes that are differentially expressed in blood from 
pati^ts with mold Osteoarthritis as compared with patients without mild Osteoarthritis 
using the ChondroChip™ platform. 

Table 4D shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients without Osteoarthritis using the 
Affymetrix® platform. 

Table 4E shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients without Osteoarthritis 
using the ChondroChip'^"^ platform. 

Table 4F shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients without Osteoarthritis 
using the Affymetrix® platform. 

Table 4G shows the identity of those genes that are differentially expressed in blood from 
patirats with marked Osteoarthritis as compared with patients without Osteoarthritis using 
the ChondroChip'^'^ platform. 

Table 4H shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients without Osteoarthritis using 
the Affymetrix® platform. 

Table 41 shows the identity of those genes that are differentially expressed in blood from 
patients with severe Osteoarthritis as compared with patients without Osteoarthritis using 
the ChondroChip'^'^ platform. 
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Table 4 J shows the identity of those genes that are differentially expressed in blood from 
patients with severe Osteoarthritis as compared with patients without Osteoarthritis using 
the Affymetrix® platform. 

Table 4K shows the identity of those genes that are differentially expressed in blood from 
5 patients with mild Osteoarthritis as compared with patients with moderate Osteoarthritis 
using the ChondroChip™ platform. 

Table 4L shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with moderate Osteoarthritis 
using the Afiymetrix® platform. 

10 Table 4M shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with marked Osteoardmtis 
using the ChondroChip™ platform. 

Table 4N shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with marked Osteoarthritis 
15 usmg the Affymetrix® platform. 

Table 40 shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with severe Osteoarthritis using 
the ChondroChip™ platform. 

Table 4P shows the identity of those genes that are differentially expressed in blood from 
20 patients with mild Osteoarthritis as compared with patients with severe Osteoarthritis using 
the Affymetrix® platform. 

Table 4Q shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with marked Osteoarthritis 
using the ChondroChip™ platform. 

25 Table 4R shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with marked Osteoarthritis 
using the Affymetrix® platform. 



59 



wo 2004/112589 PCT/US2004/020836 

Table 4S shows the identity of those genes that are differratially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with severe Osteoarthritis 
xising the ChondroChip™ platform. 

Table 4T shows the identity of those genes that are differentially expressed in blood from 
5 patients with moderate Osteoarthritis as compared with patients with severe Osteoarthritis 
using the Affymetrix® platform. 

Table 4U shows the identity of those genes that are diflferentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients with severe Osteoarthritis 
using the ChondroChip™ platform. 

10 Table 4V shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients with severe Osteoarthritis 
using the Affymetrix® platform. 

Table 5 shows those genes that are differentially expressed in blood samples from patients 
with a disease or condition of interest as compared to blood samples from patients without 
15 said disease or condition. 

Table 5A shows the identity of those genes that are differentially expressed in blood 
samples from patients with psoriasis as compared with patients not having hypertension 
using the Affymetrix® platform. 

Table SB shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with thyroid disorder as compared with patients not havmg thyroid 
disorder using the Affymetrix® platform. 

Table 5C shows the identity of those genes that are differentially expressed in blood 
samples from patients with irritable bowel syndrome as compared with patients not having 
irritable bowel syndrome using the Affymetrix® platform. 

25 Table 5D shows the identity of those genes that are differentially expressed in blood 

samples from patients with osteoporosis as compared with patients not having osteoporosis 
usitig the Affymetrix® platform. 



60 



wo 2004/1 12589 PCT/US2004/0 20836 

Table 5E shows the identity of those genes that are differentially expressed in blood 
samples from patients with migraine headaches as compared with patients not having 
migraine headaches using the Affymetrix® platfonn. 

Table 5F shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with eczema as compared with patients not having eczema using the 
Affymetrix® platfonn. 

Table 5G shows the identity of those genes that are differentially expressed in blood 
samples &om patients with NASH as compared with patients not having NASH using the 
Affymetrix® platfonn. 

10 Table 5H shows the identity of those genes that are difTerentially expressed in blood 
samples from patients with alzheimers' disease as compared with patients not having 
alzheimer's disease using the Affymetrix® platform. 

Table 51 shows the identity of those genes that are differentially expressed in blood 
samples from patients with Manic Depression Syndrome as compared with patients not 
15 having Manic Depression Syndrome using the Affymetrix® platform. 

Table 5J shows the identity of those genes that are differentially expressed in blood 
samples from patients with Crohn's Colitis as compared with patients not having Crohn's 
Colitis using the Affymetrix® platform. 

Table 5K shows the identity of tiiose genes that are differentially expressed in blood 
20 samples from patients with Chronis Cholecystits as compared with patients not having 
Chronis Cholecystits using the Affymetrix® platform. 

Table 5L shows the identity of those genes that are differentially expressed in blood 
samples from patients with Heart Failure as compared with patients not having Heart 
Failure using the Affymetrix® platform. 

25 Table 5M shows the identity of those genes that are differentially expressed in blood 

samples from patients with Cervical Cancer as compared with patients not having Cervical 
Cancer using the Affymetrix® platform. 
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Table 5N shows the identity of those genes that are differentially expressed in blood 
samples from patients with Stomach Cancer as compared with patients not having Stomach 
Cancer using the Affymetrix® platform. 

Table 50 shows the identity of those genes that are differentially expressed in blood 
samples from patients with Kidney Cancer as compared with patients not having Kidney 
Cancer using the Afiymetrix® platform. 

Table 5P shows the identity of those genes that are differentially expressed in blood 
samples from patients with Testicular Cancer as compared with patients not having 
Testicular Cancer using the Affymetrix® platform. 

Table 5Q shows the identity of those genes that are differentially expressed in blood 
samples from patients with Colon Cancer as compared with patients not having Colon 
Cancer using the Affymetrix® platform. 

Table 5R shows the identity of those genes that are differentially expressed in blood 
samples from patients with Hepatitis B as compared ^yitil patients not having Hepatitis B 
using the Affymetrix® platform. 

Table 5S shows the identity of those genes that are differentially expressed in blood 
samples from patients with Pancreatic Cancer as compared with patients not having 
Pancreatic Cancer using the Affymetrix® platform. 

Table 5T shows the identity of tiiose genes that are differentially expressed in blood 
samples from patients with Asymptomatic Chagas as compared with patients not having 
Chagas using the Affymetrix® platforai. 

Table SU shows the identity of those genes that are differentially expressed in blood 
samples from patients with Symptomatic Chagas as compared with patients not havmg 
Chagas using the Affymetrix® platform. 

Table 5V shows the identity of those genes tiiat are differentially expressed in blood 
samples from patients with Bladder Cancer as compared with patients not having Bladder 
Cancer using the Affymetrix® platform. 
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Table 6 shows those genes that are differentially expressed in blood samples from patients 
with any one of a series of related conditions as compared to blood samples from patients 
without said related conditions. 

Table 6A shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with Cancer as compared witti patients not having Cancer using the 
Afiymetrix® platform. 

Table 6B shows the identity of those gemes fliat are differentially expressed in blood 
samples from patients with Cardiovascular Disease as compared with patients not having a 
Cardiovascular Disease using the Affymetrix® platform. 

10 Table 6C shows the identity of those genes that are differentially expressed in blood 

samples from patients with a Neurological Disease as compared witihi patients not having a 
Neurological Disease using the AflEymetrix® platform. 

Table 7 shows those genes that are differentially expressed in blood samples from with a 
condition wherein said condition is a treatment as conq)ared to blood samples from patients 
IS without said condition. 

Table 7A shows the identity of those genes that are differentiaUy expressed in blood 
samples from patients taking Celebrex® as compared with patients on a Cox hihibitor 
which was not Celebrex® using the ChondroChip™ platfomi. 

Table 7B shows the identity of those genes that are differentially expressed in blood 
20 samples from patients taking Celebrex® as compared with patients not on Celebwac® using 
the ChondroChip™ platform. 

Table 7C shows the identity of those genes that are differentially expressed in blood 
samples fcom patients taking Vioxx® as compared with patients not on Vioxx® using the 
ChondroChip™ platform. 

25 Table 7D shows the identity of tihiose genes that are differentially expressed in blood 

samples from patients taking Vioxx® as compared with patients on a Cox inhibitor but not 
on Vioxx® using the ChondroChip™ platform. 
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Table 7E shows the identity of those genes that are differentially expressed in blood 
samples from patients taking NSAIDS as compared with patients not on NSAIDS using the 
ChondroChip™ platform. 

Table 7F shows the identity of those genes that are differentially expressed in blood 
5 samples from patients taking Cortisone as compared with patients not on Cortisone using 
the ChondroChip™ platform. 

Table 7G shows the identity of those genes that are differentially expressed in blood 
samples from patients taking Visco Supplement as compared with patients not on Visco 
Supplement using the ChondroChip™ platform. 

10 Table 7H shows the identity of those genes that are differentially expressed in blood 

samples from patients taking Lipitor® as compared with patients not on Lipitoi® using the 
ChondroChip™ platform- 
Table 71 shows the identity of those genes that are differentially expressed in blood 
samples from patients who are smokers as compared with patients who are not smokers 

1 5 using the ChondroChip™ platform 

Table 8A is an annotation table showing the relationship between the gene ID identified in 
Tables 1-7 wherein the data was generated using the Affymetrix® platform and gene 
identified by the AflFymetrix probe. 

Table 8B is an annotation table showing the relationship between the clone ID identified in 
20 Tables 1-7 wherein the data was generated using the ChondroChip™ platform and the gene 
identified by the EST clones. 

Table 9 shows the descriptions as to the various annotations provided for both the 
ChondroChip™ and the Affymetrix® microarray results. 

Table 10 shows how the incidence of different stages of OA varies with respect to age in 
25 males and females 

Table 11 shows 223 EST sequences of Tables 1 A- 71 with "no-significant match" to known 
gene sequence in Patent-In Format. 
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Table 12 shows a list of genes showing greater than two fold differential expression in 
CAD peripheral blood cells relative to that of normal blood cells. 



The following examples are given for the purpose of illustrating various 
5 embodiments of the invention and are not meant to hmit the present invention in any 
fashion. 



EXAMPLE 1 

10 Blood cDNA chip Microarray Data Analysis of RNA expression profiles of blood 

samples fiom individuals having coronary artery disease as compared with RNA 
expression profiles &om normal individuals. 

A microarray was constructed using cDNA clones fiiom a human peripheral blood 
cell cDNA library, as described herein. A total of 10,368 polymerase chain reaction (PGR) 
15 products of the clones from the human peripheral blood cell cDNA library described herein 
were arrayed using GNS 417 airayer (Affymetrix). RNA for microarray analysis was 
isolated fiom whole blood samples without prior firactionation, obtained Scorn three male 
and one female patients wifli coronary heart disease (80 - 90% stenosis) receiving vascular 
extension drugs and awaiting bypass surgery, and three healthy male controls. 

20 A method of high-fidelity mRNA amplification from Ipg of total RNA sample was 

used. Cy5- or Cy3-dUTP was incorporated into cDNA probes by reverse transcription of 
anti-sense RNA, primed by oligo-dT. Labeled probes were purified and concentrated to the 
desired volume. Pre- hybridization and hybridization were performed following Hegde's 
protocol (Hegde P et al., A concise guide to cDNA microarray analysis. Biotechniques 

25 2000;29: 548 - 56). After overnight hybridization and washing, hybridization signals were 
detected with a GMS 418 scanner at 635-nm (Cy5) and 532-nm (Cy3) wave lengflis (see 
Figure 17). Two RNA pools were labeled alternatively with Cy5- and Cy3-dUTP, and each 
experiment was repeated twice. Cluster analysis using GeneSpring™ 4.1 .5 (Silicon 
Genetics) revealed two distinct groups consisting of four CAD and three normal control 



65 



wo 2004/112589 PCT/US2004/020836 

samples. Two images scanned at different wavelengths were super- imposed. Individual 
spots were identified on a customized grid. Of 10,368 spots, 10,012 (96.6%) were selected 
after the removal of spots with irregular shapes. Data quaUty was assessed with values of 
ChlGTB2 and Ch2GTB2 provided by ScanAlyze. Only spots with ChlGTB2 and 

5 Ch2GTB2 over 0.50 were selected. After evaluation of signal intensities, 8750 (84.4%) 
spots were left. Sigaal intensities were normalized using a scatter-plot of the signal 
intensities of the two channels. After normalization, the expression ratios of B-actin were 
1,00 + 021 1 11+0.22, 1.14 + 0,20 and 1.30 + 0.18(24 samples offi-actin were spotted on 
this slide as the positive control) in the four images. Differential expression of RNA was 

10 assessed as the ratio of two wave-length signal intensities. Spots showing a differential 
expression more than twofold relative to normal in all four experiments were identified as 
peripheral blood cell, differentially expressed candidate genes m CAD. 108 genes are 
differentially expressed in CAD peripheral blood cells. 43 genes are downregulated in 
CAD blood cells and 65 are upregulated (see Table 12). Functional characterization of 

15 these genes firom which the differentially expressed RNA transcripts were transcribed 
shows that differential expression at the level of RNA transcription takes place in every 
gene functional category, indicating that profound changes occur in peripheral blood cells 
fi-om patients with CAD. 

The differential expression of RNA transcribed firom three genes, pro-platelet basic 
20 protein (PBP), platelet factor 4 (PF4) and coagulation factor Xm Al (F13A), initially 
identified in the microarray data analysis, was fiirther examined by reverse transcriptase- 
PCR (RT-PCR) using ihe Titan One-tube RT-PCR kit (Boehringer Mannheim). Reaction 
solution contains 0.2 mM each dNTP, 5 mM DTT, 1.5 mM MgCl 0.1 pg of total RNA 
firom each sample and 20 pmol each of left and right primers of PBP (5'- 
25 GGTGCTGCTGCTTCTGTCAT-3' (SEQ ID NO: 224) and 5'-GGCAGATTTT 

CCTCCCATCC-3'), (SEQ ID NO:225), F13A (5^-AGTCCACCGTGCTAACCATC-3' 
(SEQ ID NO:226), and 5'"AGGGAGTCACTGCTCATGCT-30 (SEQ ID NO:227), and 
PF4 (5' GTTGCTGCTCCTGCCACTT 3' (SEQ ID NO:228), and 5* 
GTGGCTATCAGTTGGGCAGT-.3')(SEQ ID NO:229), RT-PCR steps are as follows: 1. 
JO reverse-transcription: 30 min at 60 ^C; 2. PGR: 2 min at 94 °C, followed by 30 - 35 cycles 
(as optimized for each gene) for 30 s at 94 °C, 30 s at optimized annealing temperature and 
2 min at 68 °C; 3. final extension: 7 min at 68 °C. PGR products were electrophoresed on 
1.5% agarose gels. Human (DB -actin primers (5'.GCGAGAAGATGACCCAGATCAT-3* 
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(SEQ ID NO:230) and 5'-GCTCAGGAGGAGCAATGATCTT-3 (SEQ ID NO:231) were 
used as the internal control. The RT-PCR analysis confirmed that the expression of the 
three secreted proteins: PBP, PF4 and F13A were all upregulated in CAD blood cells (see 
Figures 27 and 17) 



TABLE 12 



Upregulated gene in CAD 
REV3-like, catalytic subunit 
of DNA polymerase zeta 
TGFBl-induced anti- 
apoptotic factor 1 
A disintegrin and 
metalloproteinase domain 10 
Centaurin, delta 2 
Chloride intracellular 
channel 4 

Endothelin receptor typeA 
Glutamate receptor, 
ionotropic 

Mitogen-activated protein 
kinase? 

Mitogen-activated protein 
kinase kinase kinase 7 
Myristoylated alanine-rich 
protein kinase C substrate 
NIMA-related kinase 7 
PAK2 

Phospholipid scramblase 1 
Serum dqprivation response 



Fold 

Accession Functional 
(averag 



number 



D10522 

AA093324 
AA262968 
AA054476 
Z30112 



e) 



category 



AF035537 2.3 CeU cycle 



D86970 2.2 Cell cycle 



Protein 
Accession 
Number 

]SIP_002903 

NP 510880 



AA044656 2.7 CeU signaling 1SIP_001101 

AA351412 2 CeU signaling NP_631920 

AA411940 2.2 CeU signaling NP_039234 

D90348 2.1 CeUsignaUng 1SIP_001948 

N33821 2.4 CeU signaling ]SIP_777567 

L38486 3.7 CeUsignaUng NP_002395 

AB009356 . 4.5 CeUsignaUng NP_663306 



2.5 CeU signaling NP_002347 



3.5 
3.5 
3.3 
4.5 



CeU signaling 
CeU signaling 
CeU signaUng 
CeU signaling 



NP_598001 

Q13177 
NP_066928 
NP 004648 
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Adciucm 3 


A An901 ^8 


2 Q 


Cell structure 


NP_063968 


Desmin 


AT lO ly 




Cell structure 


NP_001918 


Fibromodulin 


yVZjOI J 




Cell structure 


NP_002014 


Laminuiy beta 2 


Q77<1 7 


9 9 


Cell structure 


NP_002283 


Laminin, beta 3 


T 7^^A1 


9 4 


Cell structure 


NP^000219 


Osteonectin 


Y00755 


3.1 


Cell structure 


NP_003109 


CD59 antigen plo-2U 


wni 111 
WUllll 


9 ^ 


Cell/organism 


NP_000602 








defense 




Clusterin 


M04/Z2 




Cell/organism 


NP_001822 








defense 




F13A 




9 1 


Cell/organism 


NP_000120 








defense 




Defensin, alpha 1 


M2qoU/ 


A 9 


Cell/organism 


NP_004075 








defense 




PF4 


jvizDoy / 


9 1 


Cell/organism 


NP_002610 








defense 




PBr 




< < 


Cell/organism 


NP_002695 








defense 




lizr transcription lacior j 




2 1 


Gene expression 


NP_001940 


Early growth response 1 




9 7 


Gene expression 


NP_001955 


Eukaryotic translation 






Gene expression 


NP_001393 


elongation factor 1 alpha 1 










jj)UKaryonc uransxaiion 




2 1 


Gene expression 


NP_0O1959 


initiation lacior 










F-box and WD-40 domain 


AB014596 


2.7 


Gene expression 


NP_387449 


protein IB 










JviaKonn, Hug ixugcr piuicui, 
o 


AA331966 


2.1 


Gene expression 


NP_054879 


JNon-canonicai uDiquiiin- 




2 5 


Gene expression 


NP_057420 


conjugating enzyme i 










Nuclear receptor subfamily 


Z3U4z!> 


/I 7 

4./ 


Gene expression 


NP_005113 


1, group I, member 3 










Ring finger protein 1 1 


T08927 


3 


Gene expression 


NP_055187 
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Transducin-like enhancer of M99435 
split 1 

Alkaline phosphatase, ABOl 1406 
liver/bone/kidney 

AnnexinAS M63310 

Branched chain AA336265 
aminotransferase 1, cytosolic 

Cytochrome b AF042500 

Glutaminase D30931 

Lysophospholipase I AF035293 

NADH dehydrogenase 1, AA0S61 1 1 
subcomplex unknown 1, 6 
kDa 

Pho^hofiuctokinase M26066 

Ubiquinol-cytochrome c M22348 
reductase binding protein 

CGI-110 protein AA341061 

Dactylidin H95397 

Deleted in split-hand/spUt- T24503 
foot 1 region 

FolUstatui-like 1 R14219 

FUS-interacting protein 1 W37945 

Hypothetical protein W47233 
FU12619 

Hypothetical protein from N68247 

EUROIMAGE 588495 

Hypothetical protein AA25 1423 

LOC51315 

KIAA1705 protdn T80569 

Mesodenn induction early AI650409 
response 1 

Phosphodiesterase 4D- AA740661 
interacting protein 
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3.3 Gene expression NP_005068 
2.2 Metabolism NP_000469 

3.4 Metabolism NP_005130 
4.8 Metabolism MP 005495.1 



2.5 
2.6 
2.8 
2.5 



2.2 
2.5 

2.4 
2.7 
2.4 



Metabolism 
Metabolism 
Metabohsm 
Metabolism 



Metabolism 
Metabohsm 

Unclassified 
Unclassified 
Unclassified 



1SIP_055720 
NP_006321 
MP 002485 



NP_000280 
]SIP_006285 

NP_057131 
NP_112225 
NP 006295 



2.7 Unclassified NP_009016 

2.8 Unclassified NP_473357 
7 Unclassified NP_1 12201 

2.7 Unclassified 

2.2' Unclassified NP_057702 

2.7 Unclassified NP_009121.1 

2.2 Unclassified NP_065999 

2.5 Unclassified NP 055459 
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Preimplantation protein 3 
Putative nuclear protein 
ORF1-FL49 
Similar to rat nuclear 
ubiquitous casein kinase 2 
Similar to RIKEN 
Spectrin, beta 
Stromal cell-derived factor 
receptor 1 

Thioredoxin-related protein 
Transmembrane 4 
stq>erfan)ily member 2 
Tumor endothelial marker 8 
Downregulated gene in CAD 
CASP8andFADD-Uke 
apoptosis regulator 
CD81 antigen 
Cell division cycle 25B 
DEAD/H (Asp-Glu-Ala- 
Asp/His) box polypeptide 27 
F-box and leucine-rich repeat 
protein 11 
Minichiomosome 
maintenance deficient 3 
associated protein 
Protein phosphatase 2, 
regulatory subunit A, alpha 
isoform 

Thyroid autoantigen 70 kDa 
A disintegrin and 
metalloproteinase domain 17 
A kinase anchor protein 13 
Calpastatin 
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D59087 2.5 Unclassified 1SIP_056202 
W33098 2.8 Unclassified NP_1 15788 



H09434 



2.2 Unclassified Q9H1E3 



AA297412 
AI334431 
H71558 



2.5 
2.5 
4.1 



Unclassified 
Unclassified 



T02670 
Q01082 



Unclassified NP 816929 



AA421549 2.8 Unclassified NP_1 10437 

D29808 2.4 Unclassified NP_004606 

D79964 2.5 Unclassified NP_444262 

AF015450 0.45 Cell cycle NP_003870 



M33680 
M81934 



0.41 
0.4 



AA985699 0.42 
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EXAMPLE 2 

This example demonstrates the use of the claimed invention to identify biomarkers of 
hyperlipidimea and use of same. As used herein, a "biomarker" is any nucleic acid based 
substance that corresponds to, and can specifically identify a RNA transcript. 

As used herem, ^Tiyperlipidemia" is defined as an elevation of lipid protein profiles 
and includes the elevation of chylomicrons, very low-density lipoproteins (VLDL), 
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intermediate-density lipoproteins (DDL), low-density lipoproteins (LDL), and/or high-density 
lipoproteins (HDL) as compared with the general population, Hyperlipidemia includes 
hypercholesterolemia and/or hypertriglyceridemia. By hypercholesterolemia, it is meant 
elevated fasting plasma total cholesterol level ofi>200mg/dL, and/or LDL-cholesterol levels 
5 of>130mg/dL. A desirable level of HDL-cholesterol is> 6Qmg/dL. By hypertriglyceridemia 
it is meant plasma triglyceride (TG) concentrations of greater than the 90* or 95* percentile 
for age and sex and can include, for example, TG > 16Qmg/dL as determined after an 
overnight fast. 

The level of one or more RNA transcripts expressed in blood obtained from one or 
10 more individuals with hyperlipidemia was determined as follows. Whole blood samples were 
taken from patients who were diagnosed with hyperlipidemia as defined herein, hi each 
case, the diagnosis of hyperlipidemia was corroborated by a skilled Board certified physician. 
Total mKNA from lysed blood was isolated using TRIzol® reagent (GIBCO). 
Fluorescently labeled probes for each blood sample were generated as described above. Each 
15 probe was denatured and hybridized to a ISK Chondrogene Microarray Chip 

(ChondroChip™) and/or an Aflfymetrix GeneChip® microarray as described herein. The 
presence of a fluorescent dye on the microarray indicates hybridization of a target nucleic 
acid and a specific nucleic acid member on the microarray. The intensities of fluorescence 
dye represent the amount of target nucleic acid which is hybridized to the nucleic acid 
20 member on the microarray, and is indicative of the expression level of the specific nucleic 
acid member sequence in the target sample. 

Those transcripts which display differing levels with respect to the levels of those 
from patients unaffected by hyperlipidemia were identified as being biomarkers for said 
disease of interest. Identification of genes differentially expressed in whole blood samples 
25 from patients with hyperlipidemia as compared to healthy patients was determined by 
statistical analysis using the Wilcox Mann Whitney rank sum test. 

Classification or class prediction of a test sample as either having hypertension and 
OA or being normal can be done using the differentially expressed genes as shown in Table 
1 A in combination with well known statistical algorithms for class prediction as would be 
30 understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring'''^ for Class 
Predication are also available. 
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Figure 13 shows a diagranunatic rq)resentation of RNA expression prolBles of whole 
blood samples from individuals who were identified as having hyperlipidemia as described 
herein as compared with RNA expression profiles from normal and non-hyperlipidemia 
patients. Expression profiles were generated using GeneSpring™ software analysis as 
5 described herem. Each column represents the hybridization pattern resulting from a single 
individual. Normal individuals have no known medical conditions and were not taking any 
known medication. Non hyperlipidemia individuals presented without elevated cholesterol or 
elevated triglycerides but may have presented with other medical conditions and may be 
under various treatment regimes. 

10 A dendogriam analysis is shown above. Samples are clustered and marked as 

representing patients who have elevated lipids and/or cholesterol, are normal or do not have 
elevated lipids or cholesterol. The indicates those patients who abnormally clustered as 
having either hyperlipidemia, normal or non-hyperlipidemia despite actual presentation. The 
number of hybridizations profiles determined for hyperlipidemia patients, non- 
15 hyperhpidemia patients and normal individuals are shown. Various experiments were 

performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical analysis as described herein and those genes identified with a p value 
of < 0.05 as between the patients with hyperlipidemia as compared with patients without 
hyperhpidemia are shown in Table 

20 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with hyperlipidemia can be done using the differentially expressed 
genes as shown in Table ID in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

25 GeneSpring™) for Class Predication are also available. 

In addition to Hyperlipidemia, biomarkers for the following diseases were identified 
using the above method steps to identify one or more genetic markers for the following 
diseases; Type 11 Diabetes, Hypertension, Obesity, Lung Disease, Bladder Cancer, Coronary 
30 Artery Disease, Rheumatoid Arthritis, Depression, Osteoarthritis, Liver Cancer, 

Schizophrenia, Chagas Disease, Asthma, Lung Cancer, Heart Failure, Psoriasis, Thyroid 
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Disorder, Irritable Bowel Syndrome, Osteoporosis, Migraine Headaches, Eczema, NASH, 
Alzheimer's Disease, Manic Depression Syndrom, Crohn's Colitis, Chronic Cholecystits, 
Cervical Cancer, Stomach Cancer, Kidney Cancer, Testicular Cancer, Colon Cancer, 
Hepatitis B, and Pancreatic Cancer. 

5 Diabetes 

This example demonstrates the use of the claimed invention to identify biomarkers of 
diabetes and use of same. 

As used herein, "diabetes", or "diabetes mellitus" mcludes both "type 1 diabetes" 

10 (insulin-dependent diabetes (IDDM)) and "type 2 diabetes" (insulin-independent diabetes 
(NIDDM). Both type 1 and type 2 diabetes characterized in accordance with Harrison's 
Principles of Internal Medicine 14th edition, as a person having a venous plasma glucose 
concentration > 140mg/dL on at least two separate occasions after overnight fasting and 
venous plasma glucose concentration > 200mg/dL at 2 h and on at least one other occasion 

15 during the 2-'h test following ingestion of 75g of glucose. Patients identified as having type 2. 
diabetes as described herein are those demonstrating insulin-independent diabetes as 
determined by the methods described above. Whole blood samples were taken from patients 
who were diagnosed with type 2 diabetes as defined herein. In each case, the diagnosis of 
type 2 diabetes was corroborated by a skilled Board certified physician. Figure 12 shows a 

20 diagrammatic representation of RNA expression profilesRNA expression profiles of Whole 
blood samples from individuals who were identified as having type 2 diabetes as described 
herein as compared RNA expression profilesRNA expression profiles from individuals not 
having type 2 diabetes. RNA expression profilesRNA expression profiles were generated 
using GeneSpring™ software analysis as described herein. Hybridizations to create said 

25 RNA expression profilesRNA expression profiles were done using the 15K Chondrogene 
Microairay Chip (ChondroChip™) as described herein. Samples are clustered and marked 
as representing patients who have type 2 diabetes or control individuals. The number of 
hybridizations profiles detennined for patients with type 2 diabetes or who are controls are 
shown. Various experiments were performed as outlined above, and analyzed using either 

30 the Wilcox Mann Whitney rank sum test, or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with type 2 diabetes as 
compared with patients without type 2 diabetes are shown in Table IP. 
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Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with type 2 diabetes can be done using the differentially expressed 
genes as shown in Table IP in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
5 Conunercially available programs such as those provided by Silicon Gtenetics (e.g. 
GeneSpring™) for Class Predication are also available. 

RNA expression proiilesRNA expression profilesLung Disease 

This example demonstrates the use of the claimed invention to identify 
10 biomarkers of Lung Disease and use of same. 

As used herein, "limg disease" encompasses any disease that affects the respiratory 
system and includes bronchitis, chronic obstructive lung disease, emphysema, asthma, and 
lung cancer. Patients identified as having lung disease includes patients having one or more 
of the above noted conditions. In each case, the diagnosis of lung disease was corroborated 

15 by a skilled Board certified physician. Figure 14 shows a diagrammatic representation of 
RNA expression profilesRNA expression profiles of Whole blood samples firom individuals 
who were identified as having lung disease as described herein as compared with RNA 
expression profilesRNA expression profiles firom normal and non lung disease individuals. 
Samples are clustered and marked as representing patients who have Ixmg disease, are normal 

20 or do not have lung disease. The indicates those patients who abnormally clustered 
despite actual presentation. The number of hybridizations profiles determined for either the 
lung disease patients, non-lung disease patients and normal mdividuals are show. Various 
experiments were performed as outlined above, and analyzed using the Wilcox^Maim 
Whitney rank sum test, or other statistical analysis as described herein and those genes 

25 identified with a p value of < 0.05 as between the patients with lung disease as compared with 
patients without lung disease are shown in Table IR. 

Classification or class prediction of a test sample fi-om an unknown patient in order to 
diagnose said individual with lung disease can be done using the differentially expressed 
genes as shown in Table IR in combination with well known statistical algorithms for class 
30 prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 
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Bladder Cancer 

This example demonstrates the use of the claimed invention to identify 
biomarkers of bladder cancer and use of same. 

5 As used herein, "bladder cancer" includes carcinomas that occur in the transitional 

q7ithelium lining the urinary tract, startmg at the renal pelvis and extending through the 
\ireter, the urinary bladder, and the proximal two-thirds of the urethra. As used herein, 
patients diagnosed with bladder cancer include patients diagnosed utilizing any of the 
following methods or a combination thereof: urinary cytologic evaluation, endoscopic 

.0 evaluation for the presence of malignant cells, CT (computed tomography), MRJ (magnetic 
resonance imaging) for metastasis status. In each case, the diagnosis of bladder cancer was 
corroborated by a skilled Board certified physician. Figure 15 shows a diagrammatic 
representation of RNA expression profilesRNA expression profiles of Whole blood samples 
fi-om individuals who were identified as having bladder cancer as described herein as 

15 compared with KNA expression profilesRNA expression profiles fi-om non bladder cancer 
individuals. Expression profiles were generated using GeneSpring'^'^ software analysis as 
described herein. Each colunon represents the hybridization pattern resulting from a single 
individual. Non bladder cancer individuals presented without bladder cancer, but may have 
presented with oth^ medical conditions and may be under various treatment regimes. 

20 Hybridizations to create said RNA expression profilesRNA expression profiles were done 
usingthe AflfymetrixU133Achip. A dendogram analysis is shown above. Samples are 
clustered and marked as rq)resenting patients who have bladder cancer, or do not have 
bladder cancer. The indicates those patients who abnormally clustered as either bladder 
cancer, or non bladder cancgr deqiite actual presentation. The number of hybridizations 

25 profiles determined for patients with bladder cancer and without bladder cancer to create said 
Figure are shown. Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Maim Whitney rank sum test, or other statistical tests as described 
herein and those genes identified with a p value of < 0.05 as between the patients with 
bladder cancer as compared with patients without bladder cancer are shown in Tables IS. 

30 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with bladder cancer can be done using the differentially expressed 
genes as shown in Table IS in combination with well known statistical algorithms for class 
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prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

5 Coronary Artery Disease 

This example demonstrates the use of the claimed invention to identify 
biomarkers of coronary artery disease and use of same. 

As used herein, "Coronary artery disease" (CAD) is defined as a condition wherein at 
least one coronary artery has >50% luminal diameter stenosis, as diagnosed by coronary 
10 angiography and includes conditions in which there is atheromatous narrowing and 
subsequent occlusion of the vessel. CAD includes those conditions which manifest as 
angina, silent ischaemia, unstable angina, myocardial infarction, arrhythmias, heart failure, 
and sudden death. Patients identified as having CAD includes patients having one or more 
of the above noted conditions. In each case, the diagnosis of Coronary artery disease was 
15 corroborated by a skilled Board certified physician. Figure 17 shows a diagrammatic 

representation of RNA expression profilesRNA expression profiles of Whole blood samples 
fix)m individuals who were identified as having coronary artery disease (CAD) as described 
herein as compared with RNA expression profilesRNA expression profiles firom non- 
coronary artery disease individuals. Expression profiles were generated using GeneSpring™ 
20 software analysis as described herein. Each column represents the hybridization pattern 

resulting fi-om a single individual. Non coronary artery disease individuals presented without 
coronary artery disease, but may have presented with other medical conditions and may be 
under various treatment regimes. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the Affymetrix U133A chip. A dendogram analysis is 
25 shown above. Samples are clustered and marked as representing patients who have coronary 
artery disease or do not have coronary artery disease. The indicates those patients who 
abnormally clustered despite actual presentation. The number of hybridizations profiles 
determined for patients with CAD or without CAD are shown. Various experiments were 
performed as outUned above, and analyzed using either the Wilcox Mann Whitney rank sum 
30 test, or other statistical analysis as described herein and those genes identified with a p value 
of < 0.05 as between the patients with coronary artery disease as compared with patients 
without coronary artery disease are shown in Table lU. 
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Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with CAD can be done using the differentially expressed genes as 
shown in Table lU in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
5 available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 

Rheumatoid Arthritis 

This example demonstrates the use of the claimed invention to identify 
10 biomarkers of rheumatoid arthritis and use of same. 

As used herein "Rheumatoid Arthritis" (RA) is defined as a chronic, multisystem 
disease of unknown etiology with the characteristic feature of persistent inflammatory 
synovitis. Said inflammatory synovitis usually involves peripheral joints in a systemic 
distribution. Patients having RA as defined herein were identified as having one or more of 

15 the following; (i)cartilage destruction, (ii)bone erosions and/or (iii) joint deformities. Whole 
blood samples were taken from patients who were diagnosed Rheumatoid arthritis as defined 
herein. In each case, the diagnosis of Rheumatoid arthritis was corroborated by a skilled 
Board certified physician. Figure 18 shows a diagrammatic representation of RNA expression 
profilesRNA expression profiles of Whole blood samples &om individuals who were 

20 identified as havmg rheumatoid arthritis as described herein as compared with RNA 
expression profilesRNA expression profiles fix)m non-rheumatoid arthritis individuals. 
Expression profiles were generated using GeneSpring™ software analysis as described 
herein. Each colunrn represents the hybridization pattern resulting fit>m a single individual. 
Normal individuals have no known medical conditions and were not taking any known 

25 medication. Non rheumatoid arthritis individuals presented without rheumatoid arthritis, but 
may have presented with other medical conditions and may be under various treatment 
regimes. Hybridizations to create said RNA expression profilesRNA expression profiles 
were done usmg ChondroChip™ and Affymetrix U133A Chip. A dendogram analysis using 
the ChondroChip is shown above. Samples are clustered and marked as representing patients 

30 who have rheumatoid arthritis or do not have rheumatoid arthritis. The indicates those 
patients who abnormally clustered despite actual presentation. The number of hybridizations 
profiles determined for patients with rheumatoid arthritis and without rheumatoid arthritis are 
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shown. Various experiments were performed as outlined above and analyzed using either the 
Wilcox Mann Whitney rank sum test, or other statistical tests as described herein and those 
genes identified with a p value of < 0.05 as between the patients with rheumatoid arthritis as 
compared with patients without rheumatoid arthritis are shown. Data generated using the 
5 ChondroChip™ array is shown in Table IV whereas data generated using the Afiymetrix 
U133A Chip is shown in Table IW. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with rheumatoide arttiritis can be done using the differentially 
expressed genes as shown in Table IV and 1 W in combination with well known statistical 
10 algorithms for class prediction as would be understood by a person skilled in the art and is 
described herein. Commercially available programs such as those provided by SiUcon 
Genetics (e.g. GeneSpring'^^ for Class Predication are also available. 



15 Depression 

This example demonstrates the use of the claimed invention to identify biomarkers of 
depression and use of same. 

As used herein "depression" includes depressive disorders or depression m 
association with medical illness or substance abuse in addition to depression as a result of 

20 sociological situations. Patients defined as having depression were diagnosed mainly on the 
basis of clinical symptoms including a depressed mood episode wherein a person displays a 
depressed mood on a daily basis for a period of greater than 2 weeks. A depressed mood 
episode may be characterized by sadness, indifference, apathy, or irritability and is usually 
associated with changes in a number of neurovegetative functions, including sleep patterns, 

25 ^petite and weight, fatigue, impairment in concentration and decision making. Whole blood 
samples were taken from patients who were diagnosed with depression as defined herein. In 
each case, the diagnosis of depression was corroborated by a skilled Board certified 
physician. Figure 19 shows a diagrammatic representation of RNA expression profilesRNA 
expression profiles of Whole blood samples from individuals who were identified as having 

30 depression as described herein as compared with RNA expression profilesRNA expression 
profiles from non-depression individuals. Expression profiles were generated using 
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GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattern resulting from a single individual. Normal individuals have no known 
medical conditions and were not taking any known medication. Non depression individuals 
presented without depression, but may have presented with other medical conditions and may 

5 be under various treatment regfanes. Hybridizations to create said RNA expression 

profilesRNA expression profiles were done using ChondroChip™. A dendogram analysis is 
shown above. Samples are clustered and marked as representing patients who have 
depression, having non-depression or normal. The indicates those patients who 
abnormally clustered despite actual presentation. The number of hybridizations profiles 

10 determined for patients with depression, non-depression and normal are shown. Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test, or other statistical tests as described herein and those genes identified 
with a p value of < 0.05 as between the patients with depression as compared with patients 
without depression are shown in Table IX. 

1 5 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with depression can be done using the differentially expressed genes 
as shown in Table IX in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

20 GeneSpring™) for Class Predication are also available. 



Osteoarthritis 

This example demonstrates the use of the claimed invention to identify 
25 biomarkers which differentiate various stages of Osteoarthritis and use of same. 

"Osteoarthritis" (OA), as used herein also known as "degenerative joint disease", represents 
failure of a diarthrodial (movable, syno vial-lined) joint. It is a condition, which affects joint 
cartilage, and or subsequently underlying bone and supporting tissues leading to pain, 
stififiiess, movement problems and activity limitations. It most often affects the hip, knee, 
30 foot, and hand, but can affect other joints as well. OA severity can be graded according to the 
system described by Marshall (Marshall KW. J Rheumatol, 1996:23(4) 582-85). Briefly, 
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each of the six knee articular surfaces was assigned a cartilage grade with points based on the 
worst lesion seen on each particular surface. Grade 0 is normal (0 points). Grade I cartilage 
is soft or swollen but the articular surface is intact (1 point). In Grade II lesions, the cartilage 
surface is not intact but the lesion does not extend down to subchondral bone (2 points). 
Grade in damage extends to subchondral bone but the bone is neither eroded nor ebumated 
(3 points). In Grade IV lesions, th^e is ebumation of or erosion into bone (4 points). A 
global OA score is calculated by summing the points from all six cartilage surfaces. If there 
is any associated pathology, such as meniscus tear, an extra point will be added to the global 
score. Based on the total score, each patient is then categorized into one of four OA groups: 

3 mild (1-6), moderate (7-12), marked (13-18), and severe (>1 8). As used herein, patients 
identified with OA may be categorized in any of the four OA groupings as described above. 
Whole blood samples were taken from patients who were diagnosed with osteoarthritis and a 
specific stage of osteoarthritis as defined herein. In each case, the diagnosis of 
osteoarthritis and the stage of osteoarthritis was corroborated by a skilled Board certified 

5 physician . Figure 20 shows a diagrammatic representation of RNA expression profilesRNA 
expression profiles of Whole blood samples from individuals having various stages of 
osteoarthritis as compared with RNA expression profilesRNA expression profiles from 
normal individuals. Expression profiles were generated using GeneSpring™ software 
analysis as described herein. Each column represents the hybridization pattem resulting from 

to a single individual. Normal individuals have no known medical conditions and were not 
taking any known medication. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the ChondroChip™. A dendogram analysis is shown 
above. Samples are clustered and marked as representing patients who presented with 
different stages of osteoarthritis or normal. The indicates those patients who abnormally 

25 clustered despite actual presentation. The number of hybridizations profiles determined for 
either osteoarthritis patients or normal individuals are shown in Figure 20. Statistical analysis 
was done using an ANOVA test and those genes identified with a p value of < 0.05 in 
pairwise comparisons between patients with mild, moderate, marked, severe or no 
osteoarthritis as shown in Table 1 Y. 

30 

Liver Cancer 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
liver cancer and use of same. 

As used herein, "liver cancer" means primary liver cancer wherein the cancer initiates 
in the liver. Primary liver cancer includes both hepatomas or hepatocellular carcinomas 

5 (HCC) which start in the liver and chonalgiomas where cancers develop in the bile ducts of 
the liver. Whole blood samples were taken from patients who were diagnosed with liver 
cancer as defined herein. In each case, the diagnosis of liver cancer was corroborated by a 
skilled Board certified physician. Figure 21 shows a diagrammatic representation of RNA " 
expression profilesRNA expression profiles of Whole blood samples firom individuals who 

10 were identified as having liver cancer as described herein as compared with RNA expression 
profilesRNA expression profiles firom non-liver cancer disease individuals. Expression 
profiles were generated using GeneSpring™ software analysis as described herein. Each 
colimm represents the hybridization pattern resulting firom a single individual. Control 
samples presented without fiver cancer but may have presented with other medical conditions 

15 and may be under various treatment regimes. Hybridizations to create said RNA expression 
profilesRNA expression profiles were done using the Affymetrix® U133A chip. A 
dendogram analysis is shown above. Samples are clustered and marked as representing 
patients who have liver cancer or control. The number of hybridizations profiles determined 
for patients with liver cancer or who are controls are shown. Various experiments were 

20 performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein, and those genes identified with a p value of 
< 0.05 as between the patients with liver cancer as compared with patients without liver 
cancer are shown in Table IZ. 

Classification or class prediction of a test sample firom an unknown patient in order to 
25 diagnose said individual with liver cancer can be done using the differentially expressed 
genes as shown in Table IZ in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

30 

Schizophrenia 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
diabetes and use of same. 

As used herein, "schizophrenia" is defined as a psychotic disorders characterized by 
distortions of reality and disturbances of thought and language and withdrawal from social 
contact. Patients diagnosed with "schizophrenia" can include patients having any of the 
following diagnosis: an acute schizophrenic episode, borderline schizophrenia, catatonia, 
catatonic schizophrenia, catatonic type schizophrenia, disorganized schizophrenia, 
disorganized type schizophrenia, hebephrenia, hebephrenic schizophrenia, latent 
schizophrenia, paranoic type schizophrenia, paranoid schizophrenia, paraphrenia, paraphrenic 
schizophrenia, psychosis, reactive schizophrenia or the like. Whole blood samples were taken 
from patients who were diagnosed with schizophrenia as defined herem. In each case, the 
diagnosis of schizophrenia was corroborated by a skilled Board certified physician. Figure 
22 shows a diagrammatic representation of RNA expression profilesRNA expression profiles 
of Whole blood samples from mdividuals who were identified as having schizophrenia as 
described herein as compared with RNA expression profilesRNA expression profiles from 
non schizophrenic individuals. Expression profiles were generated usmg GeneSpring™ 
software analysis as described herein. Each column represents the hybridization pattern 
resulting from a single individual. Control samples presented without schizophrenia but may 
have presented with other medical conditions and may be under various treatment regimes. 
Hybridizations to create said RNA expression profilesRNA expression profiles were done 
usingthe A£fymetrix®U133Achip. A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who have schizophrenia or control individuals. 
The number of hybridizations profiles determined for patients with schizophrenia or who are 
controls are shown. Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test, or other statistical tests as described 
herein, and those genes identified with a p value of < 0.05 as between the patients with 
schizophrenia as compared with patients without schizophrenia are shown in Table 1 AA. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with schizophrenia can be done using the differentially expressed 
genes as shown in Table 1 AA in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Chagas disease 

This example demonstrates the use of the claimed invention to identify 
biomarkers of Chagas' disease and use of same. 

5 As used herein, "Chagas' disease" is defined as a condition wherein an individual is 

infected with the protozoan parasite Trypanosoma cruzi and includes both acute and chronic 
infection. Acute infection with T. cruzi can be diagnosed by detection of parasites by eitiier 
microscopic examination of fresh anticoagulated blood or the bufiy coat, giemsa-stained thin 
and thick blood smears and/or mouse inoculation and culturing of the blood of a potentially 

10 infected individual. Even in the absence of a positive result from the above, an accurate 
determination of infection can be made by xenodiagnosis wherein reduviid bugs are allowed 
to feed on the patient's blood and subsequently the bugs are examined for infection. Chronic 
infection can be determined by detection of antibodies specific to the T. cruzi antigens and/or 
immunoprecipitation and electrophoresis of the T. cruzi antigens. 

15 As used herein "Symptomatic Chagas disease" includes symptomatic acute chagas 

and symptomatic chronic chagas disease. Acute symptomatic chagas disease can be 
characterized by one or more of the following: area of erythema and swelling (a chagoma); 
local lymphadenopathy; generalized lymphadenopathy; mild hqpatosplenomegaly; unilateral 
painless edema of the palpebrae and periocular tissues; malaise; fever; anorexia and/or edema 

:o of the face and lower extremities. Symptomatic chronic Chagas' disease include one or more 
of the following symptoms: heart rhythm disturbances, cardiomyopathy, thromboembolism, 
electrocardiographic abnormalities including right bundle-branch blockage; atrioventricular 
block; premature ventricular contractions and tachy- and bradyarrhythmias; dysphagia; 
odynophagia, chest pain; regurgitation; weight loss, cachexia and pulmonary infections. 

5 As used herein "Asymptomatic Chagas disease" is meant to refer to individuals who 

are infected with T, cruzi but who do not show eitiier acute or chronic symptoms of the 
disease. 

Whole blood samples were taken from patients who were diagnosed symptomatic or 
asymptomatic Chagas disease as defined herein, hi each case, the diagnosis of Chagas 
} disease was corroborated by a qualified physician. Figure 23 shows a diagrammatic 

representation of RNA expression profilesRNA expression profiles of Whole blood samples 
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from individuals who were identified as having symptomatic Chagas disease; asymptomatic 
Chagas disease or who were control individuals as described herein as compared with RNA 
expression profilesRNA expression profiles from individuals not having Chagas Disease. 
Expression profiles were generated using GeneSpring™ software analysis as described 
herein. Each column represents the hybridization pattern resulting &om a single individual. 
Control samples presented without Chagas disease but may have presented with other 
medical conditions and may be under various treatment regimes. Hybridizations to create 
said RNA expression profilesRNA expression profiles were done using the Affymetrix® 
U133A chip. A dendogram analysis is shown above. Samples are clustered a^d marked as 
representing patients who have symptomatic chagas disease; asymptomatic chagas disease or 
control. The number of hybridizations profiles determined for patients with chagas disease; 
asymptomatic chagas disease or who are controls are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein. Those genes identified with a p value of < 
O.OS as between the patients with Chagas disease as compared with patients without Chagas 
disease are shown in Table 1 AB. Those genes identified with a p value of < 0.05 as between 
the patients with Asymptomatic Chagas disease as compared with patients without Chagas 
disease are shovm in Table 5T. Those genes identified with a p value of < 0.05 as between 
the patients with Symptomatic Chagas disease as compared with patients without Chagas 
disease are shown in Table 5U. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with symptomatic chagas disease can be done using the 
differentially expressed genes as shown in Table 5U in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
25 and is described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. Classification or class 
prediction of a test sample from an unknown patient in order to diagnose said individual with 
asymptomatic chagas disease can be done using the differentially expressed genes as shown 
in Table 5T. 

30 Asthma 

This example demonstrates the use of the claimed invention to idratify biomarkers of 
asthma disease and use of same. 
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As used herein, "asthma" indicates a chronic disease of the airways in the lungs 
characterized by constriction (the tightening of the muscles surrounding the airways) and 
inflammation (the swelling and irritation of the airways). Together constriction and 
inflammation cause narrowing of the airways, which results in symptoms such as wheezing, 
coughing, chest tigihtness, and shortness of breath. Whole blood samples were taken from 
patients who were diagnosed with asthma as defined herein. In each case, the diagnosis of 
asthma was corroborated by a skilled Board certified physician. Expression profiles were 
generated using GeneSpring™ software analysis as described herem. Each column 
represents the hybridization pattern resulting bom a single individual. Hybridizations to 
create said RNA expression profilesRNA expression profiles were done using the 
ChondroChip™ and the Afiymetrix Chip. Samples are clustered and marked as representing 
patients who have asthma or control individuals. The number of hybridizations profiles 
determined for patients with asthma and controls are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein. Those genes identified with a p value of < 
0.05 as between the patients with asthma as compared with patients without astimia using the 
ChondroChip™ are shown in Table IAD. Those genes identified with a p value of < 0.05 as 
between the patients with asthma as compared with patients without asthma using the 
Affymetrix® platform are shown in Table 1 AE. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with asthma can be done usmg the differentially expressed genes as 
shown in Table IAD and Table 1 AE in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g, 
GeneSpring'T^ for q^ss Prediction are also available. 



Hypertension 

This example demonstrates the use of the claimed invention to identify biomarkers of 
hypertension and use of same. 

As used herein, "hypertension" is defined as high blood pressure or elevated arterial pressure. 
Patients identified with hypertension herein include persons who haye an increased risk of 
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developing a morbid cardiovascular event and/or persons who benefit from medical therapy 
designed to treat hypertension. Patients identified with hypertension also can include persons 
having systolic blood pressure of >1 30 mm Hg or a diastoUc blood pressure of >90 mm Hg or 
a person takes antihypertensive medication. Whole blood samples were taken from patients 

S who were diagnosed with hypertension as defined her^. In each case, the diagnosis of 
hypertension was corroborated by a skilled Board certified physician. Figure S shows a 
diagrammatic representation of RNA expression profiJesRNA expression profiles of Whole 
blood samples from individuals who were identified as having hypertension as described 
herein as compared with RNA expression profilesRNA expression profiles firom non 

10 hypertensive individuals and normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattem resulting &om a single individual. Non hypertensive individuals 
presented without hypertension but may have presented with other ndedical conditions and 
may be under various treatment regimes. Normal individuals presented without any known 

15 conditions. Hybridizations to create said RNA expression profilesRNA expression profiles 
were done usmg a 15K Chondrogene Microarray Chip (ChondroChip™) as described herein. 
Samples are clustered and marked as representing patients who have hypertension or control 
individuals. The number of hybridizations profiles determined for patients with 
hypertension, without hypertension or who are controls are shown in Figure 5. Various 

20 experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank smn test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with hypertension as compared with patients 
without hypertension are shown in Table IE. Table 1 AG shows those genes identified with a 
p value of < 0.05 as between the patients with hypertension as compared with patients 

25 without hypertension fixjm gene expressions profiles generated by analogous experiments 
using the Affynetrix® GeneChip®. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with hypertension can be done using the differentially expressed 
genes as shown in Table IE and 1 AG in combination with well known statistical algorithms 
30 for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Obesity 

This example dranoxistrates the iise of the claimed invention to identify biomaikers of 
obesity and use of same. 

As used herein, "obesity" is defined as an excess of adipose tissue that imparts a 
health risk. Obesity is assessed in terms of height and weight in the relevance of age. 
Patients who are considered obese include, but axe not limited to, patients having a body mass 
index or BMI ((defined as body weight in kg divided by (height in meters/) greater than or 
equal to 30.0. Patients having obesity as defined herein are those with a BMI of greater than 
or equal to 30.0. Whole blood samples were taken Smm patients who were diagnosed with 
obesity as defined herein. In each case, the diagnosis of obesity was conoborated by a 
skilled Board certified physician. Figure 6 shows a diagrammatic representation of RNA 
expression profilesRNA expression profiles of Whole blood samples &om individuals who 
were identified as having obesity as described herein as compared with RNA expression 
profilesRNA expression profiles &om non obese individuals. KNA expression profilesRNA 
expression profiles were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said KNA expression profilesRNA expression profiles 
were done using the 15K Chondrogene Micioarray Chip (ChondroChip™) as described 
herein. Samples are clustered and marked as representing patients who have obesity, those 
who are not obese, and normal individuals. The number of hybridizations profiles 
determined for patients with obesity, were not obese, and normal individuals are shown. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test, or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with obesity as compared with 
patients without obesity are shown in Table IF. Table lAH shows those genes identified 
with a p value of < 0.05 as between the patients with obesity as compared with patients 
without obesity from gene expressions profiles generated by analogous experiments using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) . 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with obesity can be done using the differentially expressed genes as 
shown in Table IF in combination with well known statistical algorithms for class prediction 
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as would be understood by a person skilled in the art and is described herein. Connnercially 
available programs such as those provided by SiUcon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 



Psoriasis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
psoriasis and use of same. 

As used herein, "psoriasis" is defined as a common multifactorial inherited condition 
characterized by the eruption of circumscribed, discrete and confluent, reddish, silvery-scaled 
maculopapules; the lesions occur predominantly on the elbows, knees, scalp, and trunk, and 
microscopically show characteristic parakeratosis and elongation of rete ridges with 
shortening of epidermal keratinocyte transit time due to decreased cyclic guanosine 
monophosphate, according to Stedman's Online Medical Dictionary, 27th Edition. Whole 
blood samples were taken ftom patients v/bo were diagnosed with psoriasis as defined herein. 
In each case, the diagnosis of psoriasis was corroborated by a skilled Board certified 
physician. KNA expression piofilesRNA expression profiles of Whole blood samples &om 
individuals who were idaitified as having psoriasis as opposed to not havmg psoriasis as 
described herein were generated usmg GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profilesKNA expression profiles were done 
using the Af^ehix® GeneChip® platforms (U133A and U133 Plus 2.0) as described 
herein (data not shown). . Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test, or other statistical tests as 
described herein, and those genes identified with a p value of < 0,05 as between the patiaits 
with psoriasis as compared with patients without psoriasis are shown in Table 5A. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagaose said individual with psoriasis can be done using the differentially eJ^ressed genes 
as shown in Table 5A in combination witii well known statistical algorithms fi>r class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Thyroid Disorder 

This example demonstrates the use of the claimed invention to identify biomarkers of 
5 thyroid disorder and use of same. 

As used herein, '*thyroid disorder" is defined as an overproduction of thyroid hormone 
(hyperthyroidism) , underproduction of thyroid hormone (hypothyroidism), benign 
(noncancerous) thyroid disease, and thyroid cancer. Thyroid disorders include Anaplastic 
carcinoma of the thyroid, Chronis thyroiditis (Hashimoto's disease), colloid nodular goiter, 

10 hyperthyroidism, hyperpituitarism, hypothyridism-primary, hypothyridism-secondary, 
medullary thyroid carcinoma, painless (silent) thyroiditis, papillary carcinoa of the thyroid, 
subacute thyroiditis, thyroid cancer and congenital goiter, according to MEDLINE plus 
Illustrated Medical Encyclopedia . Whole blood samples were taken from patients who were 
diagnosed with a thyroid disorder as defined herein. In each case, the diagnosis of a thyroid 

15 disorder was corroborated by a skilled Board certified physician. RNA expression 
profilesRNA expression profiles of Whole blood samples from individuals who were 
identified as having a thyroid disorder as described herein were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profilesRNA expression profiles were done using the Afjfymetrix® GeneChip® 

20 platforms (U133A and U133 Plus 2.0) as described herein (data not shown). Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Maim 
Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with a thyroid disorder as compared with 
patients without a thyroid disorder are shown in Table 5B. 

25 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with thyroid disorder can be done using the diflFerentially expressed 
genes as shown in Table 5B in combmation with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

30 GeneSpring^'^ for Class Prediction are also available. 
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Irritable Bowel Syndrome 

This example demonstrates the use of the claimed invention to identify biomarkers of 
irritable bowel syndrome and use of same. 

As used herein, "irritable bowel syndrome" is defined as a common gastrointestinal disorder 

5 involving an abnormal condition of gut contractions (motility) characterized by abdominal 
pain, bloating, mucous in stools, and irregular bowel habits with altemating diarrhea and 
constipation, symptoms that tend to be chronic and to wax and wane over the years, 
according to MedicineNet, Inc., an online, healthcare media publishing company. Whole 
blood samples were taken from patients who were diagnosed with irratble bowel syndrome as 

10 defined herein. In each case, the diagnosis of irratble bowel syndrome was corroborated by 
a skilled Board certified physician. RNA expression profilesRNA expression profiles of 
Whole blood samples from individuals who were generated using GeneSpring™ software 
analj^is as described herein. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the Afifymetrix® GeneChip® platforms (U133A and 

15 U133 Plus 2.0) as described herein (data not shown).. Samples are clustered and marked as 
represmting patients who have irratble bowel syndrome or control individuals. Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Maim 
Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with irritable bowel syndrome as compared 

20 with patients without irratble bowel syndrome are shown in Table 5C. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with irritable bowel syndrome can be done using the differentially 
expressed genes as shown in Table 5C in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
25 herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

Osteoporosis 

30 This example demonstrates the use of the claimed invention to identify biomarkers of 

osteoporosis and use of same. 
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As used herein, "osteoporosis" is defined as a reduction in the quantity of bone or atrophy of 
skeletal tissue; an age-related disorder characterized by decreased bone mass and increased 
susceptibility to fractures, according to Stedman's Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken from patients who were diagnosed with osteoporosis 

5 syndrome as defined herein. In each case, the diagnosis of osteoporosis was corroborated by 
a skilled Board certified physician. RNA expression profilesRNA expression profiles of 
Whole blood samples from individuals who were identified as having osteoporosis as 
described herein as compared with RNA expression profilesRNA expression profiles from 
individuals not having osteoporosis, were generated using GeneSpring™ software analysis 

10 as described herein. Hybridizations to create said RNA expression profilesRNA expression 
profiles were done using the Affymetrix® GeneChip® platforms (U133A arid U133 Plus 
2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have osteoporosis or control individuals. Various experiments 
were performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank 

15 sum test or other statistical tests as described herein, and those genes identified with a p value 
of < 0.05 as between the patients with osteoporosis as compared with patients without 
' osteoporosis are shown in Table 5D. 

Migraine Headaches 

20 This example demonstrates the use of the claimed invention to identify biomarkers of 

migraine headaches and use of same. 

As used herein, "Migraine Headaches" is defined as a symptom complex occurring 
periodically and characterized by pain in the head (usually unilateral), vertigo, nausea and 
vomiting, photophobia, and scintillattag appearances of light. Classified as classic migraine, 

25 common migraine, cluster headache, hemiplegic migraine, ophthalmoplegic migraine, and 
ophthalmic migraine, according to Stedman's Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken from patients who were diagnosed with migraiae headaches 
as defined herein. In each case, the diagnosis of migraine headaches was corroborated by a 
skilled Board certified physician. RNA expression profiles of Whole blood samples from 

30 individuals who were identified as having migraine headaches as described herein as 

compared with RNA expression profiles from individuals not having migraine headaches, 
were generated vising GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
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(U133A and U133 Plus 2.0) as described herein (data not shown). Samples are clustered and 
marked as representing patients who have migraine headaches or control individuals. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
5 identified with a p value of < 0.05 as between the patients with migraine headaches as 
compared with patients without migraine headaches are shown in Table 5E. 

Classification or class prediction of a test sample fi:om an unknown patient in order to 
diagnose said individual with migraine headaches can be done usiag the diflFerentially 
expressed genes as shown in Table 5E in combination with well known statistical algorithms 
10 for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



IS Eczema 

This example demonstrates the use of the claimed invention to identify biomarkers of 
eczema and use of same. 

As used herein, "Eczema" is defmed as mflammatory conditions of the skm, particularly with 
vesiculation in the acute stage, typically erythematous, edematous, papular, and crusting; 

20 followed often by Uchenification and scaling and occasionally by duskiness of the erythema 
and, infrequently, hyperpigmentation; often accompanied by sensations of itching and 
burning; the vesicles form by intraepidermal spongiosis; often hereditary and associated with 
allergic rhinitis and asthma, according to Stedman*s Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken fix)m patients who were diagnosed with eczema as defined 

25 herein. In each case, the diagnosis of eczema headaches was corroborated by a skilled 
Board certified physician. RNA expression profiles of Whole blood samples firom 
individuals who were identified as having eczema as described herein as compared with 
RNA expression profiles firom individuals not having eczema, were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 

30 expression profiles were done using Afiymetrix® GeneChip® platforms (U133A and U133 
Plus 2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have eczema or control individuals. Various experiments were 
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performed as outlined above, and analyzed using either flie Wilcox Mann Whitney rank sum 
test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with eczema as compared with patients without eczema are 
shown in Table 5F. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with eczema can be done using the differentially expressed genes as 
shown in Table 5F in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in tiie art and is described herein. Commercially 
available programs such as fliose provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 



Manic Depression Syndrome 

This example demonstrates the use of the claimed invention to identify biomarkers of 
manic depression syndrome and use of same. 

As used herein, "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. Whole blood samples were taken from 
patients who were diagnosed with manic depression as defined herein. In each case, the 
diagnosis of manic depression was corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples from individuals who were identified as having 
manic depression as described herein as compared with RNA expression profiles from 
individuals not having manic depression, were generated xising GeneSpring™ software 
analysis as described herein. Hybridizations to create said RNA expression profiles were 
done using Affymetrix® GeneChip® platforais (U133A and UBS Plus 2.0) platforms 
(U133A and U133 Plus 2.0) as described herein (data not shown). Samples are clustered 
and marked as representing patients who have manic depression or control individuals. 
Various experiments were performed as outlmed above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with manic depression syndrome 
as compared with patients without manic depression syndrome are shown in Table 51. 
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Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with manic depression syndrome can be done using the 
differentially expressed genes as shown in Table 51 in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
5 and is described herein. Commercially available programs such as those provided by Sihcon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Crohn's Colitis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
10 Crohn's Colitis and use of same. 

As used herein, "Crohn's Colitis'* is defined as a chronic granulomatous inflammatory 
disease of unknown etiology, involving any part of the gastrointestinal tract firom mouth to 
anus, but commonly involving the terminal ileum with scarring and thickening of the bowel 
wall; it firequentiy leads to intestinal obstruction and fistula and abscess formation and has a 

15 high rate of recurrence after treatment, according to Borland's Illustrated Medical Dictionary. 
In each case, the diagnosis of Crohn's colitis was corroborated by a skilled Board certified 
physician. KNA expression profiles of Whole blood samples fiom individuals who were 
identified as having Crohn's colitis as described herem as compared witii RNA expression 
profiles fi:om mdividuals not having Crohn's colitis, were generated using GeneSpring™ 

20 software analysis as described herein. Hybridizations to create said KNA expression profiles 
were done usmg flie Affymetrix® GeneChip(S> platforms (U133A and U133 Plus 2.0) 
platforms (U133A and U133 Plus 2.0) as described herem (data not shown).. Samples are 
clustered and marked as representing patients who have Crohn's colitis or control individuals. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 

25 Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with Crohn's colitis as compared 
with patients without Crohn's colitis are shown in Table 5J. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with Crohn's colitis can be done using tiie differentially expressed 
30 genes as shown in Table 5 J in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled m the art and is described herein. 
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Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



5 Chronic cholecystitis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
Chronic Cholecystitis and use of same. 

As used herein, "Chronic cholecystitis" is defined as chronic inflammation of the 
gallbladder, usually secondary to lithiasis, with lymphocytic infiltration and fibrosis that may 

10 produce marked thickening of the wall, according to Stedman's Online Medical Dictionary, 
27th Edition. In each case, the diagnosis of chronic cholecystitis was corroborated by a 
skilled Board certified physician. KNA expression profiles of Whole blood samples from 
individuals who were identified as having chronic cholecystitis as described herein as 
compared with RNA expression profiles from individuals not having chronic cholecystitis, 

15 were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
not shown). Samples are clustered and marked as representing patients who have chronic 
cholecystitis or control individuals. Various experiments were performed as outlined above, 

20 and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with chronic cholecystitis as compared with patients without chronic cholecystitis are shown 
in Table 5K. 

Classification or class prediction of a test sample from an unknown patient in order to 
25 diagnose said individual with chronic cholecystitis can be done using the differentially 

expressed genes as shown in Table 5K in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled m the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpringTM) for Class Prediction are also available. 



Cervical Cancer 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
cervical cancer and use of same. 



As used herein, "Cervical Cancer" is defined as cancer of the uterine cervix, the 
portion of the uterus attached to the top of the vagina. Ninety percent of cervical cancers arise 

5 from the flattened or "squamous" cells covering the cervix. Most of the remaining 10% arise 
from the glandular, mucus-secreting cells of the cervical canal leading into the uterus. . In 
each case, the diagnosis of cervical cancer was corroborated by a skilled Board certified 
physician. RNA expression profiles of Whole blood samples from individuals who were 
identified as having cervical cancer as described herein as compared with RNA expression 

10 profiles from individuals not having cervical cancer, were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms 
(Ul 33 A and Ul 3 3 Plus 2.0) as described herein (data not shown). Samples are clustered 
and marked as representing patients who have cervical cancer or control individuals. Various 

15 experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with cervical cancer as compared with 
patients without cervical cancer are shown in Table SM. 

Classification or class prediction of a test sample from an unknown patient in order to 
20 diagnose said individual with cervical cancer can be done using the differentially expressed 
genes as shown in Table SM in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

25 



Stomach Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
stomach cancer and vise of same. 

30 As used herein, "Stomach Cancer" is defined as are malignacies of the stomach, the 

most common type being adenocarcinoma. Stomach is divided into. Cancer can develop in 
any of five different layers of the stomach. . In each case, the diagnosis of stomach cancer 
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was corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having stomach cancer as described 
herein as compared with RNA expression profiles from individuals not having stomach 
cancer, were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U1S3A and U133 Plus 2.0) as 
described herein (data not shown). Samples are clustered and marked as representing 
patients who have stomach cancer or control individuals. Various experiments wrare 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with stomach cancer as compared with patients without stomach 
cancer are shown in Table 5N. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with stomach cancer can be done using the differentially expressed 
genes as shown in Table 5N m combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Kidney Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
kidney cancer and use of same. 

As used herein, "Kidney Cancer" is defined as are malignacies of the kidney, the most 
common type being renal cell carcinomain each case, the diagnosis of kidney cancer was 
corroborated by a skilled Board certified physician. RNA e>q)ression profiles of Whole blood 
samples from individuals who were identified as having kidney cancer as described herein as 
compared with RNA expression profiles from individuals not having kidney cancer, were 
generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (Ul 33 A and Ul 33 Plus 2.0) as described herein (data 
not shown).. Samples are clustered and marked as representing patients who have stomach 
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cancer or control individuals. Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with kidney cancer as compared with patients without kidney canc^ are shown in Table 50. 

5 Classification or class prediction of a test sample firom an unknown patient in order to 

diagnose said individual with kidney cancer can be done using the differentially expressed 
genes as shown in Table 50 in combination with well known statistical algorithms for class 
prediction as woxild be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
10 GeneSpring™) for Class Prediction are also available. 

Testicular Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
15 testicular cancer and use of same. 

As xised herein, "Testicular Cancer" is defined as an abnormal, rapid, and invasive 
growth of cancerous (malignant) cells in the testicles. Id each case, the diagnosis of testicular 
cancer was corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples firom individuals who were identified as having testicular cancer as 

20 described herein were compared with RNA expression profiles firom individuals not having 
testicular cancer, using GeneSpring™ software analysis as described herein. Hybridizations 
to create said KNA expression profiles were done using the Afifymetrix® GeneChip® 
platforms (U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described 
herein (data not shown). Various experiments were performed as outlined above, and 

25 analyzed using either the Wilcox Mann Whitney rank sum test or otihier statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with testicular cancer as compared with patients without testicular cancer are shown in 
Table 5P 

Classification or class prediction of a test sample jfrom an unknown patient in order to 
30 diagnose said individual with testicular cancer can be done using the differentially expressed 
genes as shown in Table 5P in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
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Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



5 Colon Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
colon cancer and use of same. 

As used herein, "Colon Cancer" is defmed as cancer of the colon and includes 
carcinoma, which arises from the Uning of the large intestine, and lymphoma, melanoma, 

10 carcinoid tumors, and sarcomas. In each case, the diagnosis of colon cancer was corroborated 
by a skilled Board certified physician. RNA.expression profiles of Whole blood samples 
from individuals who were identified as having colon cancer as described herein as compared 
with RNA expression profiles from individuals not having colon cancer, were generated 
using GeneSpring™ software analysis as described herein. Hybridizations to create said 

15 RNA expression profiles were done using Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox . 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with colon cancer as compared 

20 with patients without colon cancer are shown in Table 5Q. 

Classification or class prediction of a test sample from an unknown patient in order to . 
diagnose said mdividual with colon cancer can be done using the differentially expressed 
genes as shown in Table 5Q in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 

25 Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

Hepatitis B 

30 This example demonstrates the use of the claimed invention to identify biomarkers of 

hepatitis B and use of same. 
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As used herein, " Hepatitis B" is a serious disease caused by hepatitis B virus (HBV) tiiat 
attacks human Uver. The virus can cause lifelong infection, cirrhosis (scarring) of the Uver, 
Uver cancer, Uver failure, and death. HBV is transmitted horizontaUy by blood and blood 
products and sexual transmission. It is also transmitted vertically from motiier to infant in tiie 
perinatal period. . In each case, the diagnosis of hepatitis B was corroborated by a skiUed 
Board certified physician. RNA expression profiles of Whole blood samples from 
individuals who were identified as having hepatitis as described herein as compared witii 
RNA expression profiles firam individuals not having hq)atitis, were generated using 
GeneSpring™ software analysis as described herem. Hybridizations to create said RNA 
expression profiles were done using the Afiymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Samples are clustered and mariced as representing patients who have hepatitis or conti»l 
individuals. Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or otiier statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients witii hepatitisas 
compared with patients without hepatitis are shown in Table 5R, 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with hepatitis B can be done using tiie differentially expressed 
genes as shown in Table 5R in combination witti well known statistical algorithms for class 
prediction as would be understood by a person skilled in tiie art and is desaibed herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Pancreatic Cancer 

This exanq)le demonstrates tiie use of the claimed mvention to identify biomarkers of 
pancreatic cancer and use of same. 

As used herein, 'Tancreatic Cancer" is defined as cancer of the colon and includes 
carcinoma, which arises from tiie luring of tiie large intestine, and lymphoma, melanoma, 
carcinoid tumors, and sarcomas. In each case, the diagnosis of pancreatic cancer was 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having pancreatic cancer as described herein 
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as compared with RNA expression profiles firom individuals not having pancreatic cancer, 
were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the AflEymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
5 not shown). Various experiments were perfonned as outlined above, and analyzed usmg 
either the Wilcox Maim Whitney rank sum test or other statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients with pancreatic 
cancer as compared with patients without pancreatic cancer are shown in Table 5S. 

Classification or class prediction of a test sample fi-om an unknown patient in order to 
10 diagnose said individual with pancreatic cancer can be done using the differentially expressed 
genes as shown in Table 5S in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

15 

Nonalcoholic Steatohepatitis (NASH) 

This example demonstrates the use of the claimed invention to identify biomarkers of 
nonalcoholic steatohepatitis and use of same. 

As used herein, "nonalcoholic steatohepatitis", (NASH) is defined as an inflammatory 
20 disease of the Uver associated with the accumulation of fat in the liver. NASH is of imcertain 
pathogenesis and histologically resembling alcoholic hepatitis, but occurring in nonalcoholic 
patients, most often obese women with non-insulin-dependent diabetes mellitus; clinically it 
is generally asymptomatic or mild, but fibrosis or cirrhosis may result. The diagnosis is 
confirmed by a liver biopsy. In each case, the diagnosis of nonalcoholic steatohepatitis was 
15 corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples fix)m individuals who were identified as having nonalcoholic steatohq>atitis as 
described herein as compared witii RNA expression profiles firom individuals not having 
nonalcoholic steatohepatitis, were generated using GeneSpring™ software analysis as 
described herein. Hybridizations to create said RNA expression profiles were done using 
JO AlSymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and 
U133 Plus 2.0) as described herein (data not shown). Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
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test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with nonalcoholic steatohepatitis as compared with patients 
without nonalcoholic steatohepatitis are shown in Table SG. 

Classification or class prediction of a test sample firom an unknown patient in order to 
5 diagnose said individual with NASH can be done using the differentially expressed genes as 
shown in Table 5G in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring^'^ for Class 
Prediction are also available. 



Alzheimer's Disease 

As used herein, "alzheimer's disease*' refers to a degenerative disease of the central 
nervous system characterized especially by premature senile mental deterioration. In each 

15 case, the diagnosis of alzheimer's disease was corroborated by a skilled Board certified 
physician. RNA expression profiles of Whole blood samples firom individuals who were 
identified as having Alzheimer's Diseaseas described herein as compared with RNA 
expression profiles firom individuals not having alzheimer's disease, were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 

20 expression profiles were done using the Affymetrix(S> GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Samples are clustered and marked as representing patients who have alzheimer's disease or 
control individuals. Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 

25 herein, and those genes identified with a p value of < 0.05 as between the patients with 
alzheiraer's disease as compared with patients without alzheimer's disease are shown in 
Table 5H. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with alzheimer's disease can be done using the differentially 
30 expressed genes as shown in Table 5H in combmation with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring"^^ for Class Prediction are also available. 
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Heart Faflure 

This example demonstrates the use of the claimed invention to identify biomarkers of 

5 heart failure and use of same. 

As used herein, "heart failure" is defined as an madequacy of the heart so that as a 
pump it fails to maintain the circulation of blood, with the result that congestion and edema 
develop in the tissues; Heart failure is synonymous with congestive heart failure, myocardial 
insufficiency, cardiac insufficiency, cardiac failure, and includes right ventricular failure, 

10 forward heart failure, backward heart failure and left ventricular failure. Resulting clinical 
syndromes include shortness of breath or nonpitting edema, enlarged tender Uver, engorged 
neck veins, and puhnonary rales in various combinations. In each case, the diagnosis of heart 
failure was corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples fi-om individuals who were identified as having heart failure described 

15 herein as compared with RNA expression profiles firom individuals not having hearat failure, 
were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said KN A expression profiles were done using Affymetrix® GeneChip® platforms 
(U133AandU133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
not shown). Samples are clustered and marked as representing patients who have heart 

20 failure or control individuals. Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with heart failure as compared with patients without heart failure are shown in Table 5L. 

Classification or class prediction of a test sample firom an unknown patient in order to 
25 diagnose said individual with heart failure can be done using flie differentially expressed 
genes as shown in Table 5L in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Ankylosing Spondylitis 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
ankylosing spondylitis and use of same. 

As used herein "ankylosing spondylitis" refers to a chronic inflammatory disease that 
affects the joints between the vertebrae of the spine, and/or the joints between the spine and 

5 the pelvis and can eventually cause the aflFected vertebrae to fuse or grow together. In each 
case, the diagnosis of heart failure was corroborated by a skilled Board certiiaed physician. 
RNA expression profiles of Whole blood samples firom individuals who were identified as 
having lieart failure described herein as compared with RNA expression profiles from 
individuals not having hearat failure, were generated usmg GeneSpring™ software analysis 

10 as described herein. Hybridizations to create said RNA expression profiles were done using 
the Affymetrix®GeneChip®platfornis(U133A and U133 Plus 2.0) platforms (Ul 33 A and 
U133 Plus 2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have ankylosing spondylitis or control individuals. Various 
experiments were performed as outlined above, and analyzed using either tiie Wilcox Mann 

15 Whitney rank sum test or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with ankylosing spondylitis as compared 
with patients without ankylosing spondylitis are shown in Table 1 AI. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with ankylosing spondylitis can be done using the differentially 
20 expressed genes as shown in Table 1 AI in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

25 

EXAMPLES 

In addition to methods to identify biomaikers associated with a condition, this 
invention also includes methods to identify biomarkers that can identify markers for a 
30 condition in an individual or group of individuals, despite the presence of one or more second 
conditions in the same individual or group of individuals. The invention also includes 
methods to identify biomaikers of a co-morbid condition. The following examples illustrate 
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embodiments of methods comprising individuals presenting with Ostearthritis and various 
second conditions, but the invention is not limited to these sets of examples. 



Osteoarthritis and Hypertension 

5 ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 

blood samples from co-morbid individuals having osteoarthritis and hypertension as 
compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect biomarkers of 
patients with osteoarthritis and hypertension or use of same. 

10 Whole blood samples were taken from patients who wwe diagnosed with 

osteoarthritis and hypertension as dejQned herein. RNA expression profiles were thra 
analysed and compared to profiles from patients unaffected by any disease. In each case, the 
diagnosis of osteoarthritis and hypertension was corroborated by a skilled Board certified 
physician . 

15 Total mRNA from whole blood wasisolated from each patient was isolated usmg 

TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K 
Chondrogene Microanay Chip (ChondroChip™) as described herein. Identification of graes 
differentially expressed in Whole blood samples from patients with disease as compared to 

20 healthy patients was detemMued by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 1 shows a diagranomatic representation of RNA expression profiles of Whole 
blood samples from individuals having hypertension and osteoarthritis as compared with 

25 RNA expression profiles from normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each colunm represents the 
hybridization pattern resulting &om a single individual. In this example, hypertensive 
patients also presented with OA, as described herein. Normal individuals have no known 
medical conditions and were not taking any known medication. Hybridizations to create said 

30 RNA expression profiles were done using the ChondroChip™. A dendogram analysis is 
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shown above. Samples are clustered and marked as rq)resenting patients who are 
hypertensive or normal. The indicates those patients who abnormally clustered as either 
hypertensive, or normal despite presenting with the reverse. The number of hybridizations 
profiles determined for either hypertensive patients or normal individuals are shown. 861 
5 differentially expressed genes were identified as being differentially expressed with a p value 
of < 0.05 as between the hypertensive patients and normal individuals. The identity of the 
differentially expressed genes is shown in Table 1 A. 

Classification or class prediction of a test sample as having hypertension and OA or 
being normal can be done using the differentially expressed genes as shown in Table 1 A in 
10 combination with well known statistical algorithms for class prediction as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



15 Osteoarthritis and Obesity 

ChondroChip™ Microanay Data Analysis of RNA expression profiles of Whole 
blood samples firom co-morbid individuals to identify biomarkers of osteoarthritis and 
obesityRNA expression profiles. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in Whole blood samples taken fiom patients with obesity and OA as compared to 
Whole blood samples taken fiom healthy patients. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and obesity as defined herein. RNA expression profiles were then analysed and 
compared to profiles from patients unaffected by any disease. In each case, the diagnosis of 

25 the disease was corroborated by a skilled Board certified physician. Total mRNA from a 
blood taken from each patient was isolated using TRIzol® reagent (GIBCO) and 
fluorescCTtly labeled probes for each blood sample were generated as described above. Each 
probe was denatured and hybridized to a ISK Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes dijBferentially expressed in 

30 Whole blood samples from patients with disease as compared to healthy patients was 
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detennined by statistical analysis usiag the Wilcox Mann Whitney rank sum test (Glantz S A. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Figure 2 shows a diagrammatic representation of RNA expression profiles of Whole 
5 blood samples jfrom individuals who were identified as obese as described herein as 

compared with RNA expression profiles fi'om normal individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting fi'om a single individual. In this example, 
obese patients also presented with OA, as described herein. Normal individuals have no 
10 known medical conditions and were not taking any known medication. Hybridizations to 
create said SNA expression profiles were done using the ChondroChip™. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who 
are obese or normal. The ^^^^ indicates those patients vAio abnormally clustered as either 
obese or normal despite presenting with the reverse. The number of hybridization profiles 
IS determined for obese patients with OA and normal individuals are shown. 913 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the obese 
patients with OA and normal individuals is noted. The identity of the differentially 
expressed genes is shown in Table IB. 

Classification or class prediction of a test sample as either having obesity and OA or 
20 being normal can be done using the differentially expressed genes as shown in Table IB in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available progran[is such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

25 Osteoarthritis and Allergies 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples firom co-morbid individuals having osteoarthritis and allergies as compared 
with RNA expression profiles &om normal individuals. 

This example demonstrates the use of the claimed invention to detect differential 
30 biomarkers of osteoarthritis and allergies. 
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Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and allergies as defined herein. These patients are classified as presenting with 
co-morbidity, or multiple disease states. RNA expression profiles were then analysed and 
compared to profiles from patients unaffected by any disease, hi each case, the diagnosis of 
5 osteoarthritis and allergies, was corroborated by a skilled Board certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to a ISK Chondrogene Microairay Chip 
(ChondroChip™) as described Herein. Identification of genes differentially expressed m 
10 Whole blood samples from patients with osteoarthritis and allergies as compared to healthy 
patients was determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 3 shows a diagrammatic representation of RNA expression profiles of Whole 
15 blood samples from individuals who were identified as having allergies as described herein 
as compared with RNA expression profiles from normal individuals. Expression profiles 
were generated using GeneSpring™ software analysis as described herein. Each colunm 
represmts the hybridization pattern resulting &om a single individual, hi this example, 
patients with allergies also presented with OA, as described hereiiL Normal individuals had 
20 no known medical conditions and were not taking any known medication. Hybridizations 
to create said RNA expression profiles were done using the ChondroChip™ . A 
dendogram analysis is shown above. Samples are clustered and marked as representing 
patients who are obese or normal. The indicates fliose patients who abnormally 
clustered as either having allergies or being normal despite presenting with the reverse. The 
25 number of hybridizations profiles detenniaed for patients with allergies and normal 

individuals are shown. 633 genes were identified as being differentially expressed with a p 
value of < 0.05 as between patients with allergies and normal mdividuals is noted. The 
identity of the differentially expressed genes is shown in Table IC. 

Classification or class prediction of a test sample to determme whether said mdividual 
30 has allergies and OA or is normal can be doneusing the differentially expressed genes as 
shown in Table IC in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
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programs such as those provided by Silicon Genetics (e-g. GeneSpring™) for Class 
Predication are also available. 



Osteoarthritis and Systemic Steroids 

ChondroChip™ Microarray Data Analysis of KNA expression profiles of Whole 
5 blood samples from co-moibid individuals having osteoarthritis and subject to systemic 
steroids as compared with RNA expression profiles from normal mdividuals 

This example demonstrates the use of the claimed invention to detect biomarkers in 
blood of patiaits subject to systemic steroids and having osteoarthritis. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
10 steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
prednisone, and hormones as a result of hormone replacement treatment. A person identified 
as having systemic steroids is one who is on one or more of the following of the above 
treatment regimes. 

Whole blood samples were taken from patients who were diagnosed with 
15 osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 
were then analysed and compared to profiles from patients imaffected by any disease. In each 
case, the diagnosis of osteoarthritis and systemic steroids was corroborated by a skilled Board 
certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent ' 
20 (GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to the 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and subject to systemic steroids as 
compared to healthy patients was determined by statistical analysis using the Wilcox Mann 
25 Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Figure 4 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were subject to systemic steroids as described herein 
as compared with RNA expression profiles from normal individuals. Expression profiles 
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were generated using GeneSpring™ software analysis as described herein. Each column 
Represents the hybridization pattern resulting from a smgle individual. In this example, 
patients takmg systemic steroids also presented with OA, as described herein. Nomal 
individuals have no known medical conditions and were not taking any known medication. 

5 Hybridizations to create said RNA expression prolBles were done using the ChondroChip™ 
. (A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who are taking systemic steroids or normal.' The "*" mdicates those 
patients who abnormally clustered as either systanaic steroids or normal despite presenting 
with the reverse. The number of hybridizations profiles detennined for patients with 

10 systemic steroids and normal individuals are shown. 605 genes were identified as bemg 
differentially expressed with a p value of < 0.05 as between patients with systemic steroids 
and normal individuals is noted. The identity of the differentially expressed genes is shown 
in Table ID. 

Classification or class prediction of a test sample from a patient as indicating said 
15 patient takes systemic steroids and has OA or as being normal can be done using the 
differentially expressed genes as shown m Table ID in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
and is described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

20 

Osteoarthritis and Hypertension Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from individuals having osteoarthritis and hypertension as compared with 
RNA expression profiles from patients having osteoarthritis only. 

25 This example demonstrates the use of the claimed invention to identify biomarkersin 

Whole blood samples which are specific to hypertension by comparing gene expression in 
blood from co-morbid patients with osteoarthritis and hypertension to Whole blood samples 
taken from OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
30 osteoarthritis and hypertension as defined herein. RNA expression profiles were then 



112 



wo 2004/1 12589 PCTAJS2004/020836 
analysed and compared to profiles fi-om patients having OA only. In each case, the diagnosis 
of osteoarthritis and/or hypertension was corroborated by a skilled Board certified physician . 



Total mKNA fix)m blood taken fi-om each patient was isolated using TRIzol® reagent 
(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
5 above. Bach probe was denatured and hybridized to a 15K Chondrogene Microairay Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples firom patients with disease as compared to OA patients only was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
10 2002). . 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). The gene list generated firom this analysis was identified 
and those genes previously identified in Table 1 A removed so as to identify those genes 
which are unique to hypertension. 790 dijfferentially expressed genes were identified as being 

15 differentially expressed with a p value of < 0.05 as between the OA and hypertensive patients 
when compared with OA individuals. 577 genes were identified as unique to hypertension. 
The identity of these differentially expressed genes are shown in Table IG. A gene list is 
also provided of the 213 genes which were found in common as between those genes 
identified in Table 1 A and genes differentially expressed in Whole blood samples taken fix)m 

20 patients with osteoarthritis and hypertension as compared to Whole blood samples taken firom 
OA patients only. The identity of these intersecting differentially expressed genes is shoAvn 
in Table IH and a venn diagram showing the relationship between the various groups of gene 
lists is found in Figure 7. 

Classification or class prediction of a test sample as having hypertension or not 
15 having hypertension can be done using the differentially expressed genes as shown in Table 
IG as the predictor genes ia combination with well known statistical algorithms as would be 
imderstood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring^w) for Class 
Predication are also available. Classification of individuals as having both OA and 
iO hypertension using the genes in Table IH can also be performed. 

Osteoarthritis and Obesity Compared with Osteoarthritis Only 
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ChondroChip™ Microarray Data Analysis of RNA expression profiles of 
Whole blood samples from co-morbid individuals having osteoarthritis and obesity as 
compared with RNA expression profiles from patients having osteoarthritis only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
5 Whole blood samples which are specific to obesity by comparing gene expression in blood 
from co-morbid patients with osteoarthritis and obesity to Whole blood samples taken from 
OA patients only. 

Whole blood samples were taken bom patients who were diagnosed with 
osteoarthritis and obesity as defined herein. RNA expression profiles were then analysed and 
10 compared to profiles from patients affected by OA only. 

In each case, the diagnosis of the disease was corroborated by a skilled Board 
certified physician. Total mRNA firom blood taken bom each patient was isolated using 
TRIzol® reagent (GIBCO) and fluorescentiy labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a ISK 
15 Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
diflFerentially expressed in Whole blood samples from patients with obesity and OA as 
compared to OA patients only was determined by statistical analysis using the Wilcox Maim 
Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical PubUshing Division, 2002). 

20 Expression profiles were generated using GeneSpring™ software analysis as 

described herein (data not shown). 671 gmes were identified as being differentially 
expressed with a p value of < 0.05 as between the obese patients with OA and those patients 
with only OA. Those genes previously identified in Table IB were removed so as to identify 
those genes which are unique to obesity. The identity of these 519 genes unique to obesity 

25 are shown in Table 11. A gene list is also provided of those genes which were found m 

common as'^between those genes identified in Table IB and genes differentially expressed in 
Whole blood samples taken from patients with osteoarthritis and obesity as compared to 
Whole blood samples taken from OA patients only. 152 genes are shown in Table IJ. A 
Venn diagram showing the relationship between the various groups of gene lists is found in 

30 Figure 8. 
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Classification or class prediction of a test sample as having obesity or not having obesity 
can be done using the differentially expressed genes as shown in Table II as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as 
5 those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. Classification of individuals as having both OA and obesity using the genes in 
Table 1 J can also be performed.Osteoarthritis and AHergies Compared wiA 
Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of KNA expression profiles of Whole 
10 . blood samples from individuals having osteoarthritis (OA) and allergies as compared with 
KNA e3q)ression profiles from individuals with OA only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to allergies by comparing gene expression ui blood 
fi-om co-morbid patients with osteoarthritis and allergies to Whole blood samples taken from 
1 5 OA patients only. 

Whole blood samples were taken fi-om patients who were diagnosed with 
osteoarthritis and allergies as defined herein. RNA expression profiles were then analysed 
and compared to profiles from patients affected by OA only. In each case, the diagnosis of 
osteoarthritis and allergies was corroborated by a skilled Board certified physician, 

20 Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 

(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to a 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and allergies as compared to OA 

25 patients only was determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
PubUshing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). 498 genes were identified as being differentially 
30 expressed with a p value of < 0.05 as between patients with allergies and OA as compared 
with patients with OA only. Of the 498 genes identified, those genes previously identified in 
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Table IC were removed so as to identify those genes which are unique to allergies. 257 
diflFerentially expressed genes were idmtified as being as unique to allergies. The identity of 
these differentially expressed genes are shown in Table IK. A gene list is also provided of 
the 241 genes which were found in conunon as between tiiose genes identified in Table 3C 
5 and genes differentially expressed in Whole blood samples taken from patients with 
osteoarthritis and allergies as compared to Whole blood samples taken from OA patients 
only-. The identity of tiiese intersecting differentially expressed genes is shown in Table IL 
and a veim diagram showing the relationship between the various groups of gene lists is 
foimd in Figure 9. 

10 Classification or class prediction of a test sample as having allergies or not having 

allergies can be done using the differentially expressed genes as showii in Table IK as the 
predictor genes in combination with well known statistical algorithms as would be 
imderstood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class . 

15 Predication are also available. Classification of individuals as havmg both OA and allergies 
using the genes in Table IL can also be performed. 

RNA expression profilesRNA expression profilesRNA expression profilesRNA expression 
profilesRNA expression profilesRNA expression profilesOsteoarthritis and Systemic 
20 Steroids Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from co-morbid individuals having osteoarthritis and subject to systemic 
steroids as compared with RNA expression profiles from with osteoarthritis only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
25 Whole blood samples which are specific to systemic steroids by comparing gene expression 
in blood from co-moibid patients with osteoarthritis and systemic steroids to Whole blood 
samples taken from OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 
30 were then analysed and compared to profiles Scorn patients having OA only. In each case, the 
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diagnosis of osteoarthritis and systemic steroids was corroborated by a skilled Board certified 
physician. 



Total mRNA from blood was taken from each patient was isolated using TRIzol® 
reagent (GBBCO) and fluorescently labeled probes for each blood sample were generated as 

5 described above. Each probe was denatured and hybridized to the 1 5K Chondrogene 

Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in Whole blood samples from patients with osteoarthritis and subject to systemic 
steroids as compared patients with OA only was detemained by statistical analysis using the 
Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 

10 USA: McGraw-Hill Medical Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data hot shown). 553 genes were identified as being differentially 
expressed with ap value of < 0.05 as between patients taking systemic steroids and OA as 
compared with patients with OA only. Of the 553 genes identified, those genes previously 

15 identified in Table ID were removed so as to identify those genes which are unique to 
systemic steroids. 362 differentially expressed genes were identified as being as unique to 
systemic steroids. The identity of these differentially expressed genes are shown in Table 
IM. A gene list is also provided of the 191 genes which were found in common as between 
those genes identified in Table 3D and genes diflferentially expressed in Whole blood samples 

20 taken fix>m patients with osteoarthritis and systemic steroids as compared to Whole blood 
samples taken from OA patients only. The identity of these mtersecting differentially 
e:a)ressed genes is shown in Table IN and a venn diagram showing the relationship between 
the various groups of gene Usts is found in Figure 10. 

Classification or class prediction of a test sample of an individual as either taking 
25 systemic steroids or not taking systemic steroids can be done using the differentially 
expressed genes as shown in Table IM as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. Classification of 
30 individuals as having both OA and taking systemic steroids using the genes in Table IN can 
also be performed. 
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Osteoarthritis and Systemic Steroids Compared with Nonnal so as to Differeutiate 
Between Types of Systemic Steroids. 

ChondroChip™ Microairay Data Analysis of RNA expression profiles of Whole 
5 blood samples from co-morbid individuals having osteoarthritis and subject to systemic 
steroids as compared witti KNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to individual types of systemic steroids by 
comparing gene expression in blood from co-morbid patients with osteoarthritis and either 
10 • on prednisone, birth control pills or taking hormones to Whole blood samples taken from OA 
patients only. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
prednisone, and hormones as a result of hormone rq>lacement treatment. A person identified 
15 as having systemic steroids is one who is on one or more of the following of the above 
treatment regimes. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 
were then analysed and compared as between the systemic steroids as compared to profiles 
20 &om patients unaffected by any disease. In each case, the diagnosis of osteoarthritis and 
systemic steroids was corroborated by a skilled Board certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
(GBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to the 15K Chondrogene Microarray Chip 
25 (ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and subject to systemic steroids as 
compared to healthy patients was determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 
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Figure 1 1 shows a diagrammatic representation of RNA expression profiles of 
Whole blood samples from individuals who were subject to either birth control, prednisone, 
or hormone replacement therapy as described herein as compared with RNA expression 
profiles firom normal individuals. Expression profiles were generated using GeneSpring™ 
5 software analysis as described herein. Each column rq)resents the hybridization pattem 
resulting fix>m a single individual. In this example, patients taking with each of the 
systemic steroids also presented with OA, as described herein. Normal individuals have no 
known medical conditions and were not taking any known medication. Hybridizations to 
create said RNA expression profiles were done using the ChondroChip™. A dendogram 

10 analysis is shown above. Samples are clustered and marked as representing patients who 
are taking birth control, prednisone, hormone replacement therapy or normal. The 
indicates those patients who abnormally clustered. The number of hybridizations profiles 
determined for patients with birth control, prednisone, hormone replacement therapy or 
normal individuals are shown. 396 genes were identified as being differentially expressed 

15 with a p value of < 0.05 as between patients with systemic steroids and normal individuals 
is noted. The identity of the differentially expressed genes is shown in Table lO. 

Classification or class prediction of a test sample firom a patient as indicating said 
patient takes systemic steroids and has OA or as being normal can be done using the 
diflFerentially expressed genes as shown in Table lO in combination with well known 
20 statistical algorithms for class prediction as would be understood by a person skilled in the art 
and is described herein. Commercially available prograntis such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Osteoarthritis and Asthma Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
25 blood samples firom individuals having osteoarthritis (OA) and asthma as compared with 
RNA expression profiles from individuals with OA only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to asthma. 

Whole blood samples were taken fiiom patients who were diagnosed with 
30 osteoarthritis and asthma as defined herein. RNA expression profiles were then analysed and 
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compared to profiles firom patients affected by asthma only. In each case, the diagnosis of 
osteoarthritis and asthma was corroborated by a skilled Board certified physician. 



Total mRNA firom blood was taken from each patient was isolated using TRIzol® 
reagent (GIBCO) and fiuorescently labeled probes for each blood sample were generated as 

5 described above. Each probe was denatured and hybridized to a 1 5K Chondrogene 

Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in Whole blood samples firom patients with osteoarthritis and asthma as compared 
to OA patients only was determined by statistical analysis using the Wilcox Mann Whitaey 
rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-ffiU 

10 Medical Publishing Division, 2002). Figure 24 shows a diagranmiatic representation of RNA 
expression profiles of Whole blood samples fix)m individuals who had asthma and 
osteoarthritis as described herein as compared with RNA expression profiles firom 
osteoarthiitic individuals. Expression profiles were generated using GeneSpring™ software 
analysis as described herein. Each column represents the hybridization pattern resulting 6om 

15 a single individual. Hybridizations to create said RNA expression profiles were done using 
the ChondroChip™ and tiie Afiymetrix Chip. (A dendogram analysis is shown above). 
Samples are clustered and marked as representing patients who have asthma and OA or fliose 
patients who have just OA. The number of hybridizations profiles determined for patients 
with asthma and patients without asthma are shown: Various experiments were performed 

20 using the ChondroChip™as outlined above and analyzed using either the Wilcox Mann 

Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with asthma and OA and patients with just 
OA are shown in Table 1 AC. Additionally experiments were performed using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and 

25 U133 Plus 2.0) as described herein (data not shown) using either the Wilcox Mann Whitney 
rank sum test, or other statistical tests as described herein, and those genes identified witii a p 
value of < 0.05 as between the patients witii asthma and without asthma are shown in Table 
IAD. 

EXAMPLE 4 

30 In addition to methods to identify biomarkers associated with a specific disease or 

condition, this invention also includes methods to identify biomarkers that distinguish 
between different stages of the condition. The following examples illustrate embodiments 
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of the application of the instant methods as applied to idratifiying biomarkers associated with 
specific stages of bladder cancer and osteoarthritis, however, this aspect of the invention is 
not limited to these particular conditions. 

Bladder Cancer 

5 Affymetrix Chip Microarray Data Analysis of RNA expression profiles of Whole 

blood samples fix>m individuals having early or advanced bladder cancer as compared with 
KNA expression profiles fix>m normal individuals. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to a stage of bladder cancer by comparing gene 
10 expression in blood from individuals with advanced bladder cancer and those without 
bladder cancer. 

As used herein, "early stage bladder cancer" includes bladder cancer wherein the 
detection of the anatomic extent of the tumor, both in its primary location and in metastatic 
sites, as defined by the TNM staging system in accordance with Harrison's Principles of 
15 Internal Medicine 14th edition can be considered early stage. More specifically, early stage 
bladder cancer can include those instances wherein the carcinoma is mainly superficial. 

As used herein, "advanced stage bladder cancel* is defined as bladder cancer wherein 
the detection of the anatomic extent of the tumor^ both in its primary location and in 
metastatic sites, as defined by the TNM staging system in accordance with Harrison's 
20 Principles of Internal Medicine 14th edition, can be considered as advanced stage. More 
specifically, advanced stage carcinomas can involve instances wherein the cancer has 
infiltrated the muscle and wherein metastasis has occurred. 

Whole blood samples were taken from patients who were diagnosed with early or 
advanced late stage bladder cancer as defined herein. RNA expression profiles were then 
25 analysed and compared to profiles &om patients unaffected by any disease. In each case, the 
diagnosis of early or advanced late stage bladder cancer was corroborated by a skilled Board 
certified physician . 

Total mRNA Scorn ablood sample taken &om each patient was isolated using TRIzol® 
reagent (GDBCO) and fluorescently labeled probes for each blood sample were generated as 
30 described above. Each probe was denatured and hybridized to a Affymetrix U133A Chip as 
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described herein. Identification of genes differentially expressed in Whole blood samples 
from patients with early or advanced late stage bladder cancer as compared to healthy 
patients was determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-HiU Medical 
5 Publishing Division, 2002). 

Figure 16 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having advanced stage bladder cancer or 
early stage bladder cancer as described herein as compared with RNA expression profiles 
from non bladder cancer individuals. Expression profiles were generated usmg 
.0 GeneSpring™ software analysis as described herein. Each colunm represents the 

hybridization pattern resulting from a single individual. Non bladder cancer individuals 
, presented witiiout bladder cancer, but may have presented with other medical conditions and 
may be under various treatment regimes. Hybridizations to create said RNA expression 
profiles were done using the AfEymetrix U133A chip. A dendogram analysis is shown above. 
15 Samples are clustered and marked as representing patients who have early stage bladder 
cancer, advanced stage bladder cancer, or do not have bladder cancer. The indicates 
those patients who abnormally clustered despite actual presentation. The number of 
hybridizations profiles determined for either early stage bladder cancer, advanced bladder 
cancer or non-bladder cancer are shown. 3,518 genes were identified as being differentially 
20 expressed witti a p value of < 0.05 using an ANOVA analysis. The identity of the 

differentially expressed genes identified is shown in Table IT. Various experiments were 
also performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank 
sum test or other statistical tests as described herein, and tiiose genes identified with a p value 
of < 0.05 as between the patients with any stage of advanced bladder cancer as compared 
25 with patients without bladder cancer are shown in Table 5V. 

Classification or class prediction of a test sample of an mdividual to determine 
whether said individual has advanced bladder cancer, early stage bladder cancer or does not 
have bladder cancer can be done using the differentially expressed genes as shown in Table 
30 IT and/or 5V as the predictor genes in combination with well known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
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available programs such as those provided by SiUcon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



Osteoarthritis Staging 

This example demonstrates the use of the clahned invention to identify 
5 biomaricers in Whole blood samples which are specific various stages of osteoarthritis so as 
to allow the monitoring (progression or regression) of disease. 

Osteoarthritis (OA), as used herein also known as "degenerative joint disease", represents 
failure of a diarthrodial (movable, synovial-lined) joint. It is a condition, which affects joint 
cartilage, and or subsequently underlying bone and supporting tissues leading to pain, 
10 stiffiiess, movement problems and activity limitations. It most often affects the hip, knee, 
foot, and hand, but can affect other joints as weU. 

OA severity can be graded according to the system described by Marshall (Marshall 
KW. J Rheumatol, 1996:23(4) 582-85). Briefly, each of the six knee articular surfaces was 
assigned a cartilage grade with points based on the worst lesion seen on each particular 

15 surface. Grade 0 is normal (0 points). Grade I cartilage is soft or swollen but the articular 
surface is intact (1 point). In Grade n lesions, the cartilage surface is not intact but the lesion 
does not extend down to subchondral bone (2 points). Grade DI damage extends to 
subchondral bone but the bone is neither eroded nor ebumated (3 points). In Grade IV 
lesions, there is ebumation of or erosion into bone (4 points). A global OA score is 

20 calculated by summing the points from all six cartilage surfaces. If there is any associated 
pathology, such as meniscus tear, an extra point will be added to the global score. Based on 
the total score, each patient is then categorized into one of four OA groups: mild (1-6), 
moderate (7-12), marked (13-18), and severe (>18). As used herein, patients identified with 
OA may be categorized in any of the four OA groupings as described above. 

25 Whole blood samples were taken from patients who were diagnosed with 

osteoarthritis and a specific stage of osteoarthritis as defined herein. RNA expression profiles 
were then analysed and compared to profiles from patients unaffected by any disease. In 
each case, the diagnosis of osteoarthritis and the stage of osteoarthritis was corroborated by a 
skilled Board certified physician . 
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Total mRNA ftom a blood sample taken from each patient was isolated using 
TRizol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K 
Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
5 differentially expressed in Whole blood samples from patients with disease as compared to 
healthy patients was determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics., 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 20 shows a diagrammatic representation of RNA expression profiles of Whole 
10 blood samples from individuals having osteoarthritis as compared with RNA expression 
profiles from normal individuals. Expression profiles were generated using GeneSpring™ 
software analysis as described herein. Each column represents the hybridization pattem 
resulting from a single individual. Normal individuals have no known medical conditions 
and were not taking any known medication. Hybridizations to create said RNA expression 
15 profiles were done using the ChondroChip™ and the Affymetrix™ Chip. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who 
presented with different stages of osteoarthritis or normal. The indicates Aose patients 
who abnormally clustered despite actual presentation. The number of hybridizations profiles 
detamined for either osteoarthritis patients or normal individuals are shown. Differentially 
20 expressed genes were identified as being differentially expressed using ANOVA analysis and 
fliose genes witii a p value of < 0.05 identified. The identity of the differentially expressed 
genes is shown in Tables 1 Y. In addition, various experiments were also performed as 
outlined above, and analyzed using either the WUcox Marni Whitney rank sum test or other 
statistical tests as described herein, and using a pairwise comparison, those genes identified 
25 with a p value of < 0.05 as between the patients with any stage of osteoarthritis as compared 
with patients without osteoarthritis are shown in Table 4A and 4B. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with osteoarthritis can be done using tiie differentially expressed 
genes as shown in Table 4A and 4B in combination with well known statistical algorithms for 
30 class prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Graietics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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DiflFerentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and normal individuals. 
The identity of the differentially expressed genes is shown in Tables 4C and 4D. 
Differentially expressed genes were also identified as being differentially expressed with a p 

5 value of < 0.05 as between patients with moderate osteoarthritis and normal mdividuals. 
Classification or class prediction of a test sample of an individual to determine whether said 
individual has mild osteoarthritis can be done using the differentially expressed genes as 
shown in Table 4C and/or 4D as the predictor genes in combmation with well known 
statistical algorithms as would be understood by a person skilled in the art and described 

ID herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring^^ for Class Predication are also 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and normal 

15 individuals. The identity of the differentially expressed genes is shown in Tables 4E and 4F. 
Classification or class prediction of a test sample of an individual to determine whether said 
individual has moderate osteoarthritis can be done using the differentially expressed genes as 
shown in Table 4E and/or 4F as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skiUed in the art and described 

20 herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with marked osteoarthritis and normal 
mdividuals. The identity of the differentially expressed genes is shown in Tables 4G and 4H. 

25 Classification or class prediction of a test sample of an individual to determine 

whether said individual has marked osteoarthritis can be done using the differentially 
expressed genes as shown in Table 4G and/or 4H as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by SiUcon 

30 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Differratially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with severe osteoarthritis and patients without 
osteoarthritis. The identity of the differentially expressed genes is shown in Tables 41 and 4J, 

Classification or class prediction of a test sample of an individual to differentiate as to 
5 whether said individual has severe osteoarthritis can be done using the differentially 

expressed genes as shown in Table 41 and/or 4 J as the predictor genes in combination with 

well known statistical algorithms as would be understood by a person skilled in the art and 

described herein. Commercially available programs such as those provided by Silicon 

Genetics (e.g. GeneSpring™) for Class Prediction are also available. ] 

i 

10 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with 
moderate osteoarttiritis. The identity of the differentially expressed genes is shown in Tables 
4K and 4L. Classification or class prediction of a test sample of an individual to differentiate 
15 as to whether said individual has mild or moderate osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4K and/or 4L as the predictor genes m 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

20 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with marked 
osteoarthritis. The identity of the differentially expressed genes is shown in Tables 4M and 
4N. 

25 Classification or class prediction of a test sample of an individual to differentiate as to 

whether said individual has mild or marked osteoarthritis can be done using the differentially 
expressed genes as shown in Table 4M and/or 4N as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 

30 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with 
severeosteoarthritis. The identity of the differentially expressed genes is shown in Tables.40 
5 and4P. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has mild or severe osteoarthritis can be done using the differentially 
expressed genes as shown in Table 40 and/or 4P as the predictor genes m combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
10 described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and patients with 
15 marked osteoarthritis. The identity of the differentially expressed genes is shown in Tables 
4Qand4R. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has moderate or marked osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4Q and/or 4R as the predictor genes in 
20 combination witii well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring^M) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and patients with 
25 severe osteoarthritis. The identity of the differentially expressed genes is shown in Tables 48 
and4T. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has moderate or severe osteoarthritis can be done using the 
differentially expressed genes as shown m Table 4S and/or 4T as the predictor genes in 
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combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



5 Differentially expressed genes were also identified as being differentially expressed 

with a p value of < 0.05 as betwera patients with maiked osteoarthritis and patients with 
severe osteoarthritis. The identity of the differentially expressed genes is shown in Tables 4U 
and4V. 

Classification or class prediction of a test sample of an individual to differentiate as to 
1 0 whether said individual has marked or severe osteoarthritis can be done using the 

differentially expressed genes as shown in Table 4U and/or 4V as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

15 EXAMPLE 5 

In addition to methods to identify biomarkers associated with a specific disease or 
condition, or stage thereof, this invention also includes methods to identify biomarkers that 
distinguish between two conditions. The pair of conditions can be closely related, can have 
unrelated etiology but display similar overt symptoms, or can be unrelated. The following 
20 examples illustrate embodiments of methods of this aspect of the invention, but the invention 
is not limited to these embodiments. 

Manic Depression Syndrome as Compared with Schizophrenia RNA expression 
profiles 

25 . This example demonstrates the use of the claimed invention to identify biomarker 

which are capable of differentiating between manic depression syndrome and schizophrenia 
and use of same. 
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Whole blood samples were taken from patients diagnosed with MDS and 
Whole blood samples were taken fix)m patients diagnosed with schizophrenia as defined 
herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having MDS compared with the profiles generated for individuals having 
S schizophrenia. In each case, the diagnosis of MDS and schizophrenia is corroborated by a 
skilled Board certified physician. RNA expression profiles of Whole blood samples bom 
individuals who were identified as having MDS as described herein as compared with KNA 
expression profiles fix)m individuals identified as having schizophrenia were generated using 
GeneSpring™ software analysis as described herem. Hybridizations to create said RNA 

10 expression profiles were done usmg the Afifymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank simi test or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with MDS as 

15 compared with patients schizophrenia are shown in Table 3A. 

Classification or class prediction of a test sample of an individual to dijfferentiate as to 
whether said individual has schizophrenia or MDS can be done using the differentially 
expressed genes as shown in Table 3 A as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
20 described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Hepatitis as compared with Liver Cancer RNA expression profiles 

25 This example demonstrates the use of the claimed invention to identify biomarker 

which are capable of differentiating between hepatitis B and liver cancer and use of same. 

Whole blood samples were taken from patients diagnosed with hepatitis B and Whole 
blood samples were taken from patients diagnosed with liver cancer as defined herein. RNA 
expression profiles from were then analyzed and the profiles generated for uidividuals having 
30 hepatitis B compared with the profiles generated for individuals having liver cancer. In each 
case, the diagnosis of hepatitis B or liver cancer is corroborated by a skilled Board certified 
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physician. RNA rapression profiles of Whole blood samples from individuals who were 
identified as having hepatitis B as described herein as compared with RNA expression 
profiles &om individuals identified as having schizophrenia were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/orU133 Plus 2.0) as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed usmg either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with MDS as 
compared with patients schizophrenia are shown in Table 3B. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has hepatitis or liver cancer can be done using the differentially 
expressed genes as shown in Table 3B as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially available programs such as those provided by SiUcon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



RNA expression profiles 

Bladder Cancer as Compared with Kidney Cancer RNA expression profiles 

This example demonstrates the use of the claimed invention to identify 
biomarker which are capable of differentiating between bladder cancer and kidney cancer and 
use of same. 

Whole blood samples were taken from patients diagnosed with bladder cancer 
and Whole blood samples were taken &om patients diagnosed with kidney canceras defined 
herein. RNA expression profiles were then analyzed and the profiles generated. In each 
case, the diagnosis of bladder cancer and kidney cancer was coiroborated by a skilled Board 
certified physician. RNA expression profiles of Whole blood samples from individuals who 
were identified as having bladder cancer as described herein as compared with RNA 
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expression profiles fi-om individuals identified as having kidney cancer were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) as described herein (data not 
5 shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with bladder cancer as 
compared with patients with kidney cancer are shown in Table 3C. 

Classification or class prediction of a test sample of an individual to differentiate as to 
10 whether said individual has bladder cancer or kidney cancer can be done using the 

differentially expressed genes as shown in Table 3C as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

15 RNA expression profiles 

Bladder Cancer as compared with Testicular CancerRNA Expression profiles 
This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differratiating between bladder cancer and testicular cancer and use of 
20 same. 

Whole blood samples were taken fi:om patients diagnosed with bladder cancer 
and Whole blood samples were taken fi:om patients diagnosed with testicular cancer as 
defined herein. RNA expression profiles were then analyzed and the profiles, generated for 
individuals having bladder cancer as compared with the profiles generated for individuals 

25 having testicular cancer. In each case, the diagnosis of bladder cancer and testicular cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having bladder cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 
testicular cancer were generated using GeneSpring™ software analysis as described herein. 

30 Hybridi2ations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) 
as described herein (data not shown). Various experiments were performed as outlined 
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above, and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical 
tests as described herein, and those genes identilfied with a p value of < 0.05 as between the 
patients with bladder cancer as compared with patients testicular cancer are shown in Table 
3D. 

5 Classification or class prediction of a test sample of an individual to difierentiate as to 

whether said individual has bladder cancer or testicular cancer can be done using the 
differentially expressed genes as shown in Table 3D as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 

10 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



RNA expression profilesKidney Cancer as compared with Testicular CancerRNA 

Expression Profiles This example demonstrates the use of the claimed invention to identify 
15 biomarker which are capable of differentiating between kidney cancer and testicular cancer 
and use of same. 

Whole blood samples were taken from patients diagnosed with kidney cancer 
and Whole blood samples wore taken from patients diagnosed with testicular cancer as 
defined herein. RNA expression profiles were then analyzed and the profiles generated for 

20 individuals having kidney cancer as compared with the profiles generated for individuals 
having testicular cancer. In each case, fiie diagnosis of kidney cancer and testicular canco: is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having kidney cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 

25 testicular cancer were generated usmg GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) 
as described herein (data not shown). Various experiments were performed as outlined 
above, and analyzed using either the Wilcox Maim Whitney rank sum test or other statistical 

30 tests as described herein, and those genes identified with a p value of < 0.05 as between the 
patients with bladder cancer as compared with patients with testicular cancer are shown in 
Table 3E. 
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Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has bladder cancer or testicular cancer can be done using the 
differentially expressed genes as shown in Table 3E as the predictor genes in combination 
with well known statistical algorithms as would be understood by a p^on skilled in the art 
5 and described herein. Commercially available programs such as those provided by SiUcon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

10 Liver Cancer as compared ^ith Stomach Cancer RNA expression profiles 

This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differentiating between Uver cancer and stomach cancer and use of 
same. 

Whole blood samples were taken firom patients diagnosed with liver cancer 
1 5 and Whole blood samples were taken from patients diagnosed with stomach cancer as 

defined herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having liver cancer as compared with the profiles generated for individuals 
having stomach cancer. In each case, the diagnosis of liver cancer and stomach cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
20 blood samples from individuals who were identified as having liver cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 
stomach cancer were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
25 Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients witti bladder cancer as compared 
with patients testicular cancer are shown in Table 3F. 

Classification or class prediction of a test sample of an individual to differentiate as to 
30 whether said individual has bladder cancer or testicular cancer can be done using the 

differentially expressed genes as shown in Table 3F as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
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and described herein. Commercially available programs such as those provided by Silicon 
Genetics (fe.g. GeneSpring™) for Class Prediction are also available. 



RNA expression profiles 

5 Liver Cancer as compared with Colon Cancer 

This example demonstrates the use of the claimed invention to identify 
biomarker which are capable of differentiating between liver cancer and colon cancer and use 
of same. 

Whole blood samples were taken fiom patients diagnosed with liver cancer 
10 and Whole blood samples were taken from patients diagnosed with colon cancer as defined 
herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having liver cancer as compared with the profiles generated for individuals 
having colon cancer. In each case, the diagnosis of hver cancer and colon cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
15 blood samples from individuals who were identified as having liver cancer as described 

herein as compared with RNA expression profiles from individuals identified as having colon 
cancer were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
20 Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with Uver cancer as compared with 
patients with colon cancer are shown in Table 3G. 

Classification or class prediction of a test sample of an individual to differentiate as to 
25 whether said individual has liver cancer or colon cancer can be done using the differentially 
expressed genes as shown in Table 3G as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

30 
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Stomach Cancer as compared with CoIod Cancer 
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This example demonstrates the use of the claimed invention to identify 
biomarkers which are capable of differentiating between stomach cancer and colon cancer 
5 and use of same. 

Whole blood samples were taken from patients diagnosed with stomach cancer 
and Whole blood samples were taken from patients diagnosed with colon cancer as defined 
herein. KNA expression profiles were then analyzed and the profiles generated for 
individuals having stomach cancer as compared with the profiles generated for individuals 

10 having colon cancer. In each case, the diagnosis of stomach cancer and colon cancer is 
corroborated by a skilled Board citified physician. KNA expression profiles of Whole 
blood samples from individuals who were identified as having stomach cancer as described 
herein as compared with RNA expression profiles from individuals identified as having colon 
cancer were generated using GeneSpring™ software analysis as described herein. 

15 Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with stomach cancer as compared 

20 with patients colon cancer are shown in Table 3H. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has stomach cancer or colon cancer can be done using the 
differentially expressed genes as shown in Table 3H as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
25 and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring*^^ for Class Prediction are also available. 



Osteoarthritis as compared with Rheumatoid Arthritis 

30 This example demonstrates the use of the claimed invention to identify 

biomarkers which are capable of differentiating between OA and RA and use of same. 
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Whole blood samples were taken from patients diagnosed with OA and Whole 
blood samples were taken from patients diagnosed with RA as defined herein. RNA 
expression profiles were then analyzed and the profiles generated for individuals having OA 
as compared with the profiles generated for individuals having RA. In each case, the 

5 diagnosis of OA and RA is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples fix>m individuals who were identified as having 
OA as described herein as compared with RNA expression profiles from individuals 
identified as havmg RA were generated usmg GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profiles were done using the 

10 Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data 
not shown). Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients with OA as 
compared with patients with RA are shown in Table 3L 

15 Classification or class prediction of a test sample of an individual to differentiate as to 

whether said individual has OA or RA can be done using the differentially expressed genes as 
shown in Table 31 as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

20 GeneSpring™) for Class Prediction are also available. 

Chagas Disease as compared with Heart Failure This example demonstrates the 
use of the claimed invention to identify biomarkers which are capable of differentiating 
between Chagas' disease and heart failure and use of same. 

25 Whole blood samples were taken firom patients diagnosed with Chagas' 

disease and Whole blood samples were taken firom patients diagnosed with heart failure as 
defined herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having Chagas' disease as compared with the profiles generated for individuals 
having heart failure. In each case, the diagnosis of Chagas' disease and heart failure is 

30 corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples fi-om individuals who were identified as having Chagas' disease as described 
herein as compared with RNA expression profiles from individuals identified as having heart 
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failure were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
5 Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with Chagas' disease as compared 
with patients with heart failure are shown in Table 31. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Chagas' disease or heart failure can be done using the 
10 differentially expressed genes as shown in Table 31 as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by SiUcon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

15 Chagas Disease as compared with Coronary Artery Disease This example 

demonstrates the use of the claimed invention to identify biomarkers which are capable of 
differentiating between Chagas' disease and coronary artery disease and use of same. 

Whole blood samples were taken from patients diagnosed with Chagas* 
disease and Whole blood samples were taken from patients diagnosed with coronary artery 

20 disease as defined herein. RNA expression profiles were then analyzed and the profiles 
generated for mdividuals having stomach cancer as compared with the profiles generated for 
mdividuals havhig coronary artery disease. In each case, the diagnosis of Chagas' disease 
and coronary artery disease is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples firom mdividuals who were identified as having 

25 Chagas' disease as described herein as compared with RNA expression profiles from 

mdividuals identified as having coronary artery disease were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outhned above, 

30 and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
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with Chagas' disease as compared with patients coronary artery disease are shown in Table 
3L. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Chagas' disease or coronary artery disease can be done using the 
5 differentially expressed genes as shown in Table 3L as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

10 

RNA expression profilesRNA expression profilesRNA expression profilesRNA 
expression profiles. 

Coronary Artery Disease (CAD) as compared with Heart FaUure 

This example demonstrates the use of the claimed invention to identify biomarkers 
1 5 which are capable of differentiating between Coronary Artery Disease (CAD) and Heart 
Failure and use of same. 

Whole blood samples were taken fcom patients diagnosed with having Coronary 
Artery Disease (CAD) and Whole blood samples were taken from patients diagnosed with 
having Heart Failure as defined herein. RNA expression profiles were then analyzed and the 

20 profiles generated for individuals having CAD as compared with the profiles generated for 
individuals heart failure. Li each case, the diagnosis of heart failure and coronary artery 
disease is corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples firom individuals who were identified as having coronary artery disease 
as described herein as compared with RNA expression profiles fi'om individuals identified as 

25 having heart failure were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profiles were done using the 
Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data 
not shown). Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 

30 and those genes identified with a p value of < 0.05 as between the patients with coronary 
artery disease as compared with patients with heart failure are shown in Table 3N. 
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Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has coronary artery disease or heart failure can be done using the 
differentially expressed genes as shown in Table 3N as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
5 and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 
Asymptomatic Chagas Disease as compared with Symptomatic Chagas Disease 

This example demonstrates the use of the claimed invention to identify biomarkers 
10 which are capable of differentiating between Asymptomatic Chagas Disease and 
Symptomatic Chagas Disease and use of same. 

Whole blood samples were taken from patients diagnosed with having Asymptomatic 
Chagas Disease and Whole blood samples were taken from patients diagnosed with 
Symptomatic Chagas Disease as defined herein. RNA expression profiles were then 

15 analyzed and the profiles generated for individuals having Asymptomatic Chagas Disease as 
compared with the profiles generated for individuals with Symptomatic Chagas Disease. In 
each case, the diagnosis of Asymptomatic Chagas Disease and and Symptomatic Chagas 
Disease is corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples from individuals who were identified as having Asymptomatic Chagas 

20 Disease as described herein as compared with RNA expression profiles from individuals 
identified as having Symptomatic Chagas Disease were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platforms (Ul 33 A and/or Ul 33 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 

25 and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with Asymptomatic Chagas Disease as compared with patients with Symptomatic Chagas 
Disease are shown in Table 3P. 

Classification or class prediction of a test sample of an individual to differentiate as to 
30 whether said individual has Asymptomatic Chagas Disease or Symptomatic Chagas Disease 
can be done using the differentially expressed genes as shown in Table 3P as the predictor 
genes in combination with well known statistical algoritiuns as would be imderstood by a 
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person skiUed in the art and described herein. Commercially available programs such as 
those provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also 
available. 



Alasheimer's Disease as compared with Schizophrenia 

This example demonstrates the use of the claimed invention to identify biomarkers 
which are capable of differentiating between Alzheimer's Disease and Schizophrenia and use 
of same. 

10 Whole blood samples were taken from patients diagnosed with having Alzheimer's 

Disease Disease and Whole blood samples were taken from patients diagnosed with 
Schizophrenia as defmed herein. KNA expression profiles were then analyzed and the 
profiles generated for individuals havmg Alzhehner's Disease as compared with the profiles 
generated for individuals with Schizophrenia . hi each case, the diagnosis of Alzheimer's 

15 Disease and Schizophrenia is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples from individuals who were identified as havuag 
Alzheimer's Disease as described herein as compared with KNA expression profiles from 
individuals identified as Schizophrenia were generated using GeneSprmg™ software analysis 
as described herem. Hybridizations to create said RNA expression profiles were done usmg 

20 the Afiymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein 
(data not shown). Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein, and those genes identified with a p value of < 0.05 as between the patients with 
Alzheimer's Disease as compared with patients Schizophrenia are shown in Table 3Q. 

25 Classification or class prediction of a test sample of an individual to diflferentiate as to 

whether said individual has Alzheimer's Disease or Schizophrenia can be done using the 
differentially expressed genes as shown in Table 3Q as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 

30 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Alzheimer's Disease as compared with Manic Depression 

This example demonstrates the use of the claimed invention to identify biomarkers 
5 which are capable of differentiating between Alzheimer's Disease and Manic Depression and 
use of same. 

Whole blood samples were taken from patients diagnosed with having Alzheimer's 
Disease Disease and Whole blood samples were taken from patients diagnosed with Manic 
Depression as defined herem. KNA expression profiles were then analyzed and the profiles 

10 generated for individuals having Alzheimer's Disease as compared with tibie profiles 

generated for individuals with Manic Depression. In each case, the diagnosis of Alzheimer's 
Disease and Manic Depression is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples &om individuals who were identified as having 
-Alzheimer's Disease as described herein as compared with RNA expression profiles Scorn 

15 individuals identified as Manic Depression were generated using GeneSpring™ software 
analysis as described herein. Hybridizations to create said RNA expression profiles were 
done using the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 
and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 

20 described herein, and those genes identified with a p value of < 0.05 as between the patients 
with Alzheuner's Disease as compared with patients Manic Depression are shown in Table 
3R. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Alzheimer's Disease or Manic Depression can be done using die 
25 differentially expressed genes as shown in Table 3R as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Sihcon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

30 RNA expression profiles 
EXAMPLE 5 
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In addition to methods to identify markers that distinguish between two diseases or 
conditions, this invention also includes methods to identify biomarkers specific for a group of 
three or more related diseases or conditions. The following three examples present methods 
to identify biomarkers for the following groups of diseases or conditions: cancer, 
5 cardiovascular disease and neurological disease, and the identified markers thereof. However 
the invention is not limited to these three groups of diseases or conditions. 



Cancer 

This example demonstrates the use of tiie claimed invention to identify biomarkers of 
10 cancer and use.of same. 

As xised herein "Cancer" is defined as any of the various types of maUgnant 
neoplasms, most of which invade surrounding tissues, may metastasize to several sites, and 
are likely to recur after attempted removal and to cause death of the patient unless adequately 
treated; especially, any such carcinoma or sarcoma, but, in ordinary usage, especially the 

1 5 former. In each case, the diagnosis of Cancer was corroborated by a skilled Board certified 
physician. RNA expression profilesRNA expression profiles of Whole blood samples firom 
individuals who were identified as having cancer as described herein as compared with RNA 
expression profiles fi:om individuals not having cancer, were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 

20 were done using the Afifymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) 
platforms as described herein (data not shown). Various experiments were performed as 
outlined above, and analyzed usmg either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein, and those genes identified with a p value of < 0.05 as 
between the patients with cancer as compared with patients without cancer are shown in 

25 Table 6A. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with cancer can be done using the differentially expressed genes as 
shown in Table 6A in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described harein. Commercially 
30 available programs such as those provided by Silicon Genetics (e.g. GeneSpring^M) for Class 
Prediction are also available. 
Cardiovascular Disease 



142 



wo 2004/112589 PCTAJS2004/020836 

This example demonstrates tiie use of the claimed invention to identify biomarkers of 
caidiovascular disease and use of same. 

As used herein in tfiis example "Cardiovascular Disease" is defined as a disease 
affecting the heart or blood vessels. Cardiovascular diseases inchide coronary artery disease, 

5 hearart failure, and hypertension. In each case, the diagnosis of Cardiovascular Disease was 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having Cardiovascular Disease as described 
herein as compared with RNA expression profiles from individuals not having 
Cardiovascular Disease, were generated using GeneSpring™ software analysis as described 

10 herein. Hybridizations to create said RNA expression profiles were done using the 

Affymetrix® GeneChip® platfonns (U133A and U133 Plus 2.0) platforms as described 
herein (data not shown). Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 

15 with Cardiovascular Disease as compared with patients without Cardiovascular Disease are 
shown in Table 6B. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with Cardiovascular Disease can be done using the differentially 
expressed genes as shown in Table 6B in combination with well known statistical algorithms 
20 for class prediction as would be understood by a person skilled in the art and is described 
herein 



25 Nearologlcal Diseases 

This example demonstrates the use of the claimed invention to identify biomarkers of 
Neurological Disease and use of same. 

As used herein "Neurological Disease" is defined as a disorder of the nervous 
system, and include disorders that uivolve the central nervous system (brain, brainstem and 
30 cerebellum), the peripheral narvous system (including cranial nerves), and the autonomic 
nervous system (parts of which are located in both central and peripheral nervous system). In 
particular neurological disease includes alzheimers', schizophrenia, and manic depression 
syndrome. In each case, the diagnosis of Neurological Disease was corroborated by a 
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skilled Board certified physician. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platfonns (U133A and U133 Plus 2.0) 
platforms as described herem (data not shown). Various experiments were performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 

5 statistical tests as described herein, and those genes identified with a p value of < 0.05 as 
between the patients with neurological Disease as compared with patients without 
neurological Disease are shown in Table 6C. 

Classification or class prediction of a test sample firom an unknown patient in order to 
diagnose said individual with Neurological Disease can be done using the differentially 

10 expressed genes as shown in Table 6C in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein 



15 EXAMPLE 6 

In addition to methods to identify biomarkers that are associated with a specific group 
of diseases or conditions, another aspect of this invention includes methods to identify 
biomarkers that are associated with the administration of a specific drug or exogenous 
substance, or a specific grouping of drugs or exogenous substances thereof. In essence this 

20 aspect of the invention provides a method of providing an individuals drug signature. The 
administration of the exogenous substance(s) or drug(s) can be via any route and the instant 
methods of identifying these markers can be applied at any specifies time point(s) after said 
administration. The following examples illustrate embodiments of this drug signature aspect 
of the invention, but the invention is not limited to the methods comprising the drug(s) and 

25 exogenous substance(s), or groups of drugs amd exogenous substances illustrated below. 

Celebrex*^ 

Celebrex versus other COXinhihitorsi 

This example demonstrates the use of the claimed invention to identify biomarkers 
30 associated with Celebrex^ and use of same. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken fi-om individuals who have been adminstered with 
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Celebrex^ as compared to Whole blood samples taken ftom individuals who have been 
adminstered with any Cox inhibitor except Celebrex^. 

As used herein "Cox Inhibitor^ ' is defined as anti-inflammatory drug that covalently 
modifies cyclooxygenases (Cox). RNA expression profiles &om individuals who have been 
adminstered with Celebrex^ were analyzed and compared to profiles &om individuals who 
have been adminstered with any Cox mhibitor except Celebrex^ Preferably healthy 
individuals are chosen who are age and sex matched to said individuals being compared- 
Total mRN A fi-om a blood sample is taken fi-om each individual and isolated using TRIzolC© 
reagent (GEBCO) and fluorescently labelled probes for each blood sample is generated as 
described above Hybridizations to create said RNA expression profiles were done using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms as described 
herein (data not shown). Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein. Identification of genes differentially expressed in Whole blood samples 
from individuals who have been adminstered with Celebrex^ as compared to individuals 
who have been adminstered with any Cox inhibitor except Celebrex^ is determined by 
statistical analysis usmg the Wilcox Mann Whitney rank sum test using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein. Those differentially 
expressed genes identified with a p value of < 0.05 as between the individuals who have 
been adminstered with Celebrex^ as compared to individuals who have been adminstered 
with any Cox inhibitor except Celebrex^ are shown in Table 7A. 

Celebrex versus no Celebrex: 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Celebrex^ as compared to Whole blood samples taken from individuals who have been not 
been adminstered with Celebrex^ RNA expression profiles from individuals who have been 
adminstered with Celebrex^ were analyzed and compared to profiles from individuals who 
have not been adminstered with Celebrex^ Preferably healtiiy individuals are chosen who 
are age and sex matched to said individuals being compared. Total mKNA from a blood 
sample is taken from each individual and isolated usmg TRIzol® reagent (GffiCO) and 
fluorescently labelled probes for each blood sample is generated as described above 
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Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platfoms (U133A and U133 Plus 2.0) platforms as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein. 

5 Identification of genes differentially expressed in Whole blood samples from mdividuals 
who have been adminstered \yith Celebrex^ as compared to individuals who have been not 
been adminstered with Celebrex^ is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or oflier 
statistical tests as described herein. Those differentially expressed genes identified with ap 

10 value of < 0.05 as between the individuals who have been adminstered with Celebrex^ as 
compared to individuals who have not been adminstered with Celebrex^ are shown in Table 
7B. 



Vioxx^ 

15 Viox3 ^ versus no Vioxy^i 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Vioxx^ as compared to Whole blood samples taken from individuals who have been not 
been adminstered withVioxx^. RNA expression profiles from individuals who have been 

20 adminstered with Vioxx^ were analyzed and compared to profiles from individuals who have 
not been adminstered with Vioxx^. Preferably healthy individuals are chosen who are age 
and sex matched to said individuals being compared. Total mRNA from a blood sample is 
taken from each individual and isolated using TRIzol® reagent (GIBCO) and fluorescently 
labelled probes for each blood sample is generated as described above. Hybridizations to 

25 create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms as described herein (data not shown). Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test or otiier statistical tests as described herein. Identification of genes 
differentially expressed m Whole blood samples from mdividuals who have been 

30 adminstered with Vioxx^ as compared to individuals who have been not been adminstered 
with Vioxx^ is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
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herein. Those differentiaUy expressed genes identified with a p value of < 0.05 as between 
the individuals who have been adminstered with Vioxx*^ as compared to individuals who 
have not been adminstered with Vioxx'^ are shown in Table 7C. 



5 Vioxx^ versus other COX inhibitors 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken ftom individuals who have been adminstered with 
Vioxx^ as compared to Whole blood samples taken fix)m individuals who have been 
adminstered with any Cox inhibitor except Vioxx^ 

10 KNA expression profiles fi-om individuals who have been adminstered with Vioxx 

were analyzed and compared to profiles torn individuals who have beaa adminstered with 
any Cox inhibitor except Vioxx^. Preferably healthy individuals are chosen who are age and 
sex matched to said individuals being compared. Total mRNA firom a blood sample is taken 
from each individual and isolated using TRIzol® reagent (GBCO) and fluorescently labelled 

15 probes for each blood sample is generated as described above. Hybridizations to create said 
RNA expression profiles were done using the Affymetrix® GeneChip® platforms (U133A 
and U133 Plus 2.0) platforms as described herein (data not shown). Various experiments 
were perfonned as outlined above, and analyzed using either the Wilcox Mann Whitney rank 
sum test or other statistical tests as described herein.. Identification of genes differentially 

20 expressed in Whole blood samples fixnn individuals who have been adminstered with 
Vioxx^ as compared to individuals who have-been adminstered with any Cox inhibitor 
except Vioxx'^ is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein. Those differentially expressed genes identified with a p value of < 0.05 as between 

25 Ae individuals who have been adminstered with Vioxx^ as compared to mdividuals who 
have been admmstered with any Cox mhibitor except Vioxx"^ are shown m Table 7D. 

Non-steroidal anti-inflammatory agents (NSAIDs) 

30 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from individuals who have been adminstered wifli 
non-steroidal anti-inflammatory agents as compared to Whole blood samples taken from 
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individuals who have been not been adminstered with non-steroidal anti-inflammatory 
agents. As defined herein, non-steroidal anti-inflanmiatory agents are defined as a large 
group of anti-inflammatory agents that work by inhibiting the production of prostaglandins. 
They exert anti-inflammatory, analgesic and antipyretic actions and include: ibuprofen, 
5 ketoprofen, piroxicam, naproxen, sulindac, aspirin, choline subsalicylate, diflunisal, 

fenoprofen, indomethacin, meclofenamate, salsalate, tohnetin and magnesium salicylate. Not 
icluded are steroidal compounds (such as hydrocortisone or prednisone) exerting anti- 
inflammatory activity. RNA expression profiles ftom individuals who have been adminstered 
with non-steroidal anti-inflammatory agents were analyzed and compared to profiles firom 
10 individuals who have not been adminstered with non-steroidal anti-inflammatory agents. 
Preferably healthy mdividuals are chosen who are age and sex matched to said individuals 
being compared. Total mRNA from a blood sample is taken &om each individual and 
isolated using TRI^ol® reagent (GIBCO) and fluorescently labeUed probes for each blood 
sample is generated as described above. Hybridizations to create said RNA expression 
15 profiles were done usmg the Affymetrix® GeneChip® platforms (U133A and U133 Plus 
2.0) platforms as described herein (data not shown). Various experiments were performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Identification of genes differentially expressed in Whole 
blood samples from individuals who have been adminstered withnon-steroidal anti- 
20 inflammatory agents as compared to individuals who have been not been adminstered with 
non-steroidal anti-inflammatory agents is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Those differentially expressed genes identified with a p 
value of < 0.05 as between tiie mdividuals who have been adminstered with non-steroidal 
25 anti-inflammatory agents as compared to individuals who have not been adminstered with 
non-steroidal anti-inflammatory agents are shown in Table 7E. 

Cortisone 

This example demonstrates the use of the claimed invention to detect differential gene 
30 expression in Whole blood samples taken from individuals who have been adminstered witii 
Cortisone as compared to Whole blood samples taken from individuals who have been not 
been adminstered witii Cortisone. RNA expression profiles from individuals who have been 
adminstered with Cortisone were analyzed and compared to profiles from individuals who 
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have not been adminstered witii Cortisone. Preferably healthy individuals are chosen who 
are age and sex matched to said individuals being compared. Total mKNA from a blood 
sample is taken from each individual and isolated using TRIzol(i> reagent (GIBCO) and 
fluorescently labelled probes for each blood sample is generated as described above. 

5 Hybridizations to create said KNA expression profiles were done using the AfEymetrix® 
GeneChip®^platfonns(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as 
described herem (data not shown). Various e3q)eriments were perfijrmed as outlined above, 
and analyzed using either the Wilcox Mann Whitney rank sum test or oflier statistical tests as 
described herein.. Identification of genes differentially expressed in Whole blood samples 

10 from individuals who have been adminstered with Cortisone as compared to individuals who 
have been not been adminstered with Cortisone is determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or 
other statistical tests as described herem. Those differentially expressed genes idraitified with 
a p value of < 0.05 as between the individuals who have been adminstered wifli Cortisone 

15 as compared to individuals who have not been adminstered with Cortisone are shown in 
Table 7F. 



Visco Supplement 

This example demonstrates the use of the claimed mvention to detect differential gene 
20 expression in Whole blood samples taken from individuals who have been adminstered with 
Visco Supplement as compared to Whole blood samples taken from individuals who have 
been not been adminstered with Visco Supplement. RNA expression profiles from 
individuals who have been adminstered with Visco Supplement were analyzed and compared 
to profiles from individuals who have not been adminstered with Visco Supplement. 
25 Preferably healthy individuals are chosen who are age and sex matched to said individuals 
being compared. Total mRNA from a blood sample is taken from each individual and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labelled probes for each blood 
sample is generated as described above. Hybridizations to create said RNA expression 
profiles were done using the Affymetrix® GeneChip® platforms platforms (U133A and 
30 U133 Plus 2.0) as described herein (data not shown). Various experiments were performed 
as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Idraitification of genes differentially expressed in Whole 
blood samples from individuals who have been adminstered with Visco Supplement as 
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compared to individuals who have he&a not been adminstered with Visco Supplement is 
detennined by statistical analysis using the Wilcox Mann Whitaey rank sum test using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein. Those 
differentially expressed genes identified with a p value of < 0*05 as between the individuals 
5 who have been adminstered with Visco Supplement as compared to mdividuals who have 
not been adminstered with Visco Supplement are shown in Table 7G. 

Lipitor 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 

10 Lipitor as compared to Whole blood samples taken from individuals who have been not been 
adminstered with Lipitor. RNA expression profiles from individuals who have been 
adminstered with Lipitor were analyzed and compared to profiles from mdividuals who have 
not been adminstered with Lipitor. Preferably healthy individuals are chosen who are age 
and sex matched to said individuals being compared. Total mRNA from a blood sample is 

15 taken from each mdividual and isolated using TRIzol® reagent (GIBCO) and fluorescently 
labelled probes for each blood sample is generated as described above. HybricKzations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and 11133 Plus 2.0) platforms as described herein (data not shown). Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 

20 Whitney rank sum test or other statistical tests as described herein. Identification of genes 
differentially expressed in Whole blood samples from individuals who have been 
adminstered with Lipitor as compared to individuals who have been not been adminstered 
with Lipitor is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either tiie Wilcox Mann Whitney rank sum test or other statistical tests as described 

25 herein. Those differentially expressed genes identified with a p value of < 0.05 as between 
the individuals who have been adminstered with Lipitor as compared to individuals who 
have not been adminstered with Lipitor are shown in Table 7H. 

Smoking 

This example demonstrates the use of the claimed invention to detect differential gene 
30 expression in Whole blood samples taken from individuals who have smoked cigarettes and 
cigars as compared to Whole blood samples taken from individuals who have not smoked 
cigarettes and cigars . RNA expression profiles from individuals who have smoked were 
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analyzed and compared to profiles fixjm individuals who have not smoked. Preferably 
healthy individuals are chosen who are age and sex matched to said individuals being 
compared. Total mRNA fi-om a blood sample is taken firom each individual and isolated 
using TRl2X)l® reagent (GIBCO) and fluorescently labelled probes for each blood sample is 
generated as described above. Hybridizations to create said RNA expression profiles were 
done using the Afiymetrix®GeneChip® platforms (U133A and U133 Plus 2.0) platforms 
as described herein (data not shown). Various experiments were performed as outlined 
above, and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical 
tests as described herem. Identification of genes diflferentially expressed in Whole blood 
samples firom individuals who have smoked as compared to individuals who have not 
smoked is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein. Those differentially expressed genes identified with a p value of < 0.05 as between 
the individuals who have smoked as compared to individuals who have not smoked are 
shown in Table 71. 

Example 7. 

Identification of Genes Specific for OA Only by Removing Genes Relevant to Co- 
Morbidities and Other Disease States, 

This example demonstrates the use of the claimed invention to detect differential gene 
expression m blood unique to Osteoarthritis as compared with other disease states. 

Whole blood samples were taken fix>m patients who were diagnosed with mild OA or 
severe OA and compared with individuals who were identified as normal individuals as 
defined herem. RNA expression profiles were then analysed to identify genes which are 
differentially expressed in OA as compared with normal. In each case, the diagnosis of OA 
was corroborated by a qualified physician. 

Total mRNA firom a blood sample taken firom each patient was isolated using 
TRIzol® reagent (GIBCO) and fluorescentiy labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a ChondroChip™ 
as described herein. Identification of genes differentially expressed in Whole blood samples 
firom patients with mild or severe OA as compared to healthy patients was determined by 
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Statistical analysis using the Weltch ANOVA test QMichelson and Schofield, 1996). 
(Dendogram analysis not shown). 

In order to identify genes differentially expressed in blood unique to OA but not 
differentially expressed as a result of possible co-morbidities including hypertension, obesity, 

5 asthma, taking systemic steroids, or allergies, genes identified as differentially expressed in 
both OA and any of the genes identified as differentially expressed as a result of co- 
morbidity, e.g.. Table lA (co-morbidity of OA and hypertension v. normal), Table IB (co- 
morbidity of OA and obesity v. normal), Table 3C (co-morbidity of OA and allergy v. 
normal), Table 3D (co-morbidity of OA and taking systemic steroids v. normal), and genes in 

10 common with people identified as having asthma and OA (Table 3 AA) were removed. 
Sunilarly any genes and unique to obesity (Table 3R), hypertension (Table 3P), allergies 
(Table 3T), systemic steroids (Table 3V) were also removed. As a result of these 
comparisons, a list of genes unique to individuals with OA were identified. The identity of 
the differentially expressed genes is shown in Table SAB, 

15 It would be clear to a person skilled in the art that rather than simply remove those 

genes which are relevant to other disease states, one could use a more refined analysis and 
remove those genes which show the same trend in gene expression, e.g. remove those genes 
which show up regulation in a co-morbid state and also show up-regulation in the single 
disease state, but retain those genes which show a different trend in gene expression e.g. 

20 retain those genes which show up regulation in a co-naoibid state as compared to down 
regulation in a single disease state. 

Classification or class prediction of a test sample of an individual to determine 
whether said individual has OA or does not have OA can be done using the differentially 
expressed genes as shown in Table 3 AB, irrespective of whether the individual presents with 
25 co-morbidity using well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpringT^O for Class Predication are also available. 

Brain Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
30 having brain cancer as compared with RNA expression profiles firom normal individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with brain cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "brain cancer" refers to all forms of primary brain tumors, both 
5 intracranial and extracranial and includes one or more of the following: Glioblastoma, 

Ependymoma, Gliomas, Astrocytoma, MeduUoblastoma, Neuroglioma, OUgodendroglioma, 
Meningioma, Retinoblastoma, and Craniopharyngioma, 

Whole blood samples are taken from patients diagnosed with brain cancer as defined 
herein. RNA expression profiles are then analysed and compared to profiles from patients 
10 unaffected by any disease. Preferably healthy patients are chosen who are age and sex 

matched to said patients diagnosed with disease. In each case, the diagnosis of brain cancer 
is corroborated by a skilled Board certified physician. 

Total mRNA firom a blood sample is taken from each patient and isolated using 
TKIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample are 

15 generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or ChondroChip™ as described herem. Identification of genes 
differentially expressed in Whole blood samples from patients with brain cancer as compared 
to healthy patients is determined by statistical analysis using the Wilcox Maim Whitney rank 
sum test (Glantz SA. Primer of Biostatistics, 5th ed.. New York, USA: McGraw-Hill Medical 

20 Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has brain cancer or does not having brain cancer can be done using 
the differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
25 skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Prostate Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having prostate cancer as compared with RNA expression profiles from normal mdividuals. 
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This exaiiq)le demonstrates the use of the claimed invention to detect differential gene 
ejq>ression in Whole blood samples taken from patients diagnosed with prostate cancer as 
compared to Whole blood samples iakisa from healthy patients 

As used herein "prostate cancer" refers to a malignant cancer originating within the 
5 prostate gland. Patients identified as having prostate cancer can have any stage of prostate 
cancer, as determined clinically (by digital rectal exam or PSA testing) and or pathologically. 
Staging of prostate cancer can done in accordance with TNM or the Staging System of the 
American Joint Committee on Cancer (AJCC). hi addition to the TNM system, other systems 
may be used to stage prostate cancer, for example, the Whitmore-Jewett system. 

10 Whole blood samples are taken from patients diagnosed with prostate cancer as 

defined herein. RNA expression profiles are then analysed and compared to profiles firom 
patients unaffected by any disease to identify genes which differentiate as between the two 
groups. Similarly RNA expression profiles can be analysed so as to differentiate as between 
the severity of the prostate cancer. Preferably healthy patients are chosen who are age and 

15 sex matched to said patients diagnosed with disease or with a specific stage of said disease. 
la each case, the diagnosis of prostate cancer is corroborated by a skilled Board certified 
physician. 

Total inRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 

20 generated as described above. Each probe is denatured and hybridized to a Aftymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with prostate cancer as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitiiey rank sum test (Glantz SA. Primer of Biostatistics. 5tii ed. New York, USA: 

25 McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has prostate cancer, has a specific stage of prostate cancer, or does 
not having prostate cancer can be done using the differentially expressed genes identified as 
described above as the predictor genes in combination with well known statistical algorithms 
30 as would be understood by a person skilled in the art and described herein. Commaxsially 
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available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



Ovarian Cancer 

Analysis of RNA expression profiles of Whole blood samples firom individuals 
5 having ovarian cancer as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with ovarian cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "ovarian cancer" refers to a malignant cancerous growth originating 
10 within the ovaries. Patients identified as having ovarian cancer can have any stage of ovarian 
cancer. Staging is done by combining information from imaging tests with the results of a 
surgical exammation done during a laprotomy. Numbered stages I to IV are used to describe 
the extent of tiie cancer and whether it has spread (metastasized) to more distant organs. 

Whole blood samples are taken &om patients diagnosed with ovarian cancer, or with a 
15 specific stage of ovarian cancer as defined herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease or with a 
specific stage of said disease. In each case, the diagnosis of ovarian cancer is corroborated by 
a skilled Board certified physician. 

20 Total mRNA from a blood sample is taken from each patient and isolated using 

TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with ovarian cancer and or a 

25 specific stage of ovarian cancer as compared to healthy patients is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 
5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether said 
individuals has ovarian cancer, has a specific stage of ovarian cancer or does not having 
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ovarian cancer can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for 
5 Class Predication are also available. 

Gastric Cancer 

Analysis of RNA expression profiles of Whole blood samples firom individuals 
having gastric cancer as compared wifli RNA expression profiles &om normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
10 expression in Whole blood samples taken firom patients diagnosed with gastric cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "gastric or stomach cancer" refers to a cancerous growth originating 
within the stomach and includes gastric adenocarcinoma, primary gastric lymphoma and 
gastric nonlymphoid sarcoma. Patients identified as having stomac can also be categorized 
15 by stage of said cancer as determined by the System of the American Joint Committee on 
Cancer (AJCC). 

Whole blood samples are taken firom patients diagnosed with stomach cancer, or with 
a specific stage of stomach cancer as defined herein. KNA expression profiles are then 
analysed and compared to profiles from patients unaffected by any disease. Preferably 
20 healthy patients are chosen who are age and sex matched to said patients diagnosed with 
disease or with a specific stage of said disease. In each case, the diagnosis of stomach cancer 
is corroborated by a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 

25 generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip^^ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with stomach cancer and or a 
specific stage of stomach cancer as compared to healthy patients is determined by statistical 
analysis usmg the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 

30 5th ed.. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 
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Classification or class prediction of a test sample of an individual to determine 
whether said individuals has stomach cancer, has a specific stage of stomach cancer or does 
not having stomach cancer can be done using the differentially expressed genes identified 
as described above as the predictor genes m combination with well known statistical 
5 algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

Breast Cancer 

Analysis of RNA expression profiles of Whole blood samples fiom individuals 
10 having breast cancer as compared with RNA expression profiles &om normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken fiom patients diagnosed with breast cancer as 
compared to Whole blood samples taken fiom healthy patients. 

As used herein "breast cancer*' refers to a cancerous growth originating within the 
15 breast and includes invasive and non invasive breast cancer such as ductal carcinoma in situ 
(DCIS), lobular carcinoma in situ (LCIS), infiltrating ductal carcinoma, and infiltrating 
lobular carcinoma. Patients identified as having breast cancer can also be categorized by 
stage of said cancer as determined by the System of the American Joint Committee on Cancer 
(AJCC) or TNM classification. 

20 Whole blood samples are taken from patients diagnosed with breast cancer, or with a 

specific stage of breast cancer as defmed herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease or with a 
specific stage of said disease. In each case, the diagnosis of breast cancer is corroborated by 

25 a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
30 differentially expressed in Whole blood samples from patients with breast cancer and or a 
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specific stage of breast cancer as compared to healthy patients is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 
5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
5 whether said individuals has breast cancer, has a specific stage of breast cancer or does not 
have breast cancer can be done using the differentially expressed genes identified as 
described above as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
10 GeneSpring™) for Class Predication are also available. 

Nasopharyngeal Cancer 

Analysis of RNA expression profiles of Whole blood samples firom individuals 
having nasopharyngeal cancer as compared with RNA expression profiles from normal 
individuals. 

15 ' This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with nasopharyngeal 
cancer as compared to Whole blood samples taken from healthy patients. 

As used herein "nasopharyngeal cancer" refers to a cancerous growth arising from the 
epithelial cells that cover the surface and line the nasopharynx. Patients identified as having 
20 nasopharyngeal cancer can also be categorized by stage of said cancer as determined by the 
System of the American Joint Committee on Cancer (AJCC) or TNM classification. 

Whole blood samples are taken from patients diagnosed with nasopharyngeal cancer, 
or with a specific stage of nasopharyngeal cancer as defined herein. RNA expression profiles 
etre then analysed and compared to profiles from patients unaffected by any disease. 
25 Preferably healthy patients are chosen who are age and sex matched to said patients 

diagnosed with disease or with a specific stage of said disease. In each case, the diagnosis of 
nasopharyngeal cancer is corroborated by a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated usmg 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
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generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples firom patients with nasopharyngeal cancer 
and or a specific stage of breast cancer as compared to healthy patients is determined by 
5 statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A, Primer of 
' Biostatistics, 5th ed., New York, USA: McGraw-ffill Medical PubUshing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has nasopharyngeal cancer, has a specific stage of nasopharyngeal 
cancer or does not have nasopharyngeal cancer can be done using the differentially 
10 expressed genes identified as described above as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e,g. GeneSpring™) for Class Predication are also available. 

Guillain Barre syndrome 

15 Analysis of KNA expression profiles of Whole blood samples firom individuals 

having Guillain Barre syndrome as compared with RNA expression profiles firom normal 
individuals. 

This example demonstrates the use of the claimed invention to detect diflFerential gene 
expression in Whole blood samples taken firom patients diagnosed with Guillain Barre 
20 syndrome as compared to Whole blood samples taken from healthy patients. 

As used herein "Guillain Barre syndrome" refers to an acute, usually rapidly 
progressive form of inflammatory polyneuropathy characterized by muscular weakness and 
mild distal sensory loss. 

Whole blood samples are taken from patients diagnosed with Guillain Barre 
25 syndrome as defined herein. RNA expression profiles are then analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who 
are age and sex matched to said patients diagnosed with disease. In each case, the diagnosis 
of Guillain Barre syndrome is corroborated by a skilled Board certified physician. 
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Total mRNA from a blood sample is taken from each patient and isolated using 
TRizol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Afiymetrix 
U133A Chip and/or a OiondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with Guillain Barre syndrome 
as compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed.. New Yoric, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has Guillain Barre syndrome, or does not have Guillain Barre 
syndrome can be done using the diflferentially expressed genes identified as described 
above as the predictor genes in combination with weU known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
available programs such as those provided by SiUcon Genetics (e.g. GeneSpring™) for 
Class Predication are also available. 

Fibromyalgia 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having Fibromyalgia as compared with RNA expression profiles from normal individuals. 

This example demonstrates tiie use of tiie claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with Fibromyalgia as 
compared to Whole blood samples taken fixMn healthy patients. 

As used herein "Fibromyalgia" refers to widespread chronic musculoskeletal pain and 
fetigue. The pain comes from the connective tissues, such as the muscles, tendons, and 
ligaments and does not involve the joints. Whole blood samples are taken &om patients 
diagnosed with Fibromyalgia as defined herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease. In each 
case, the diagnosis of Fibromyalgia is corroborated by a skiUed Board certified physician. 

Total mRNA from a blood san^le is taken from each patient and isolated usmg 
TRIzol® reagent (GIBCO) and fluorescentty labeled probes for each blood sample is 
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generated as described above. Each probe is denatured and hybridized to a AflEymetrix 
U133A Chip and/or a ChondroChip™ as described herem. Identification of genes 
differentially expressed in Whole blood samples fcom patients with Fibromyalgia as 
compared to healthy patients is deteraiined by statistical analysis using the Wilcox Mann 
5 Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has Fibromyalgia, or does not have Fibromyalgia can be done 
using the differentially expressed genes identified as described above as the predictor genes 
10 in combmation with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Multiple Sclerosis 

Analysis of KNA expression profiles of Whole blood samples firom individuals 
15 having Multiple Sclerosis as compared with RNA expression profiles fix)m normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken &om patients diagnosed with Multiple Sclerosis as 
compared to Whole blood samples taken firom healthy patients. 

20 As used herem "Multiple Sclerosis" refers to chronic progressive nervous disorder 

involving the loss of myelin sheath surrounding certain nerve fibres. Whole blood samples 
are taken fiom patients diagnosed with Multiple Sclerosis as defined herein. RNA expression 
profiles are then analysed and compared to profiles firom patients unaffected by any disease. 
Preferably healthy patients are chosen who are age and sex matched to said patients 

25 diagnosed vnth disease. In each case, the diagnosis of Multiple Sclerosis is corroborated by a 
skilled Board certified physician. 

Total mRNA firom a blood sample is taken firom each patient and isolated using 
TRIzol® reagent (GBBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
30 U133A Chip and/or a ChondroChip™ as described herem. Identification of genes 
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differentially expressed in Whole blood samples from patients with Multiple Sclerosis as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

5 Classification or class prediction of a test sample of an individual to determine 

whether said individuals has Multiple Sclerosis, or does not have Multiple Sclerosis can be 
done using the differentially expressed genes identified as described above as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as 
10 those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. 

Muscular Dystrophy 

Analysis of RNA expression profiles of Whole blood samples fi:om individuals 
having Muscular Dj^trophy as compared witti RNA e7q)ression profiles from normal 
15 individuals. 

This example demonstrates the use of the claimed invention to detect dififerential gene 
expression in Whole blood samples taken from patients diagnosed with Muscular Dystrophy 
as compared to Whole blood samples taken fix>m healthy patients. 

As used herein "Muscular Dystrophy" refers to a hereditary disease of the muscular 
20 system characterized by weakness and wasting of the skeletal muscles. Muscular Dystrophy 
includes Duchennes' Muscular Dystrophy, limb-girdle muscular dystrophy, myotonia 
atrophica, myotonic muscular dystrophy, pseudohypertrophic muscular dystrophy, and 
Steinhardt's disease. 

Whole blood samples are taken from patients diagnosed with Muscular Dystrophy as 
25 defined herein, RNA expression profiles are then analysed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of Muscular 
Dystrophy is corroborated by a skilled Board certified physician. 
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Total mRNA from a blood sample is taken from each patient and isolated using 
TRizol® reagent (GBBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described 'herein. Identification of genes 
5 diffCTentially expressed m Whole blood samples from patients with Muscular Dystrophy as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA,. Primer of Biostatistics, 5th ed.. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
10 whether said individuals has Muscular Dystrophy, or does not have Muscular Dystrophy 
can be done using the differentially expressed genes identified as described above as the 
predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Conmiercially available 
programs such as those provided by SiUcon Genetics (e.g. GeneSpring™) for Class 
15 Predication are also available. 

Septic Joint Arthroplasty 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having septic joint arthroplasty as compared with RNA expression profiles from normal 
individuals. 

20 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from patients diagnosed with septic joint 
arthroplasty as compared to Whole blood samples taken from healthy patients. 

As used herein "septic joint arthroplasty" refers to an inflanunation of the joint caused 
by a bacterial infection. 

25 Whole blood samples are taken from patients diagnosed with septic joint arthroplasty 

as defined herein. RNA expression profiles are tiien analysed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of septic 
joint arthroplasty is corroborated by a skilled Board certified physician. 
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Total mRNA ftom a blood sample is taken &om each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for eadi blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a OiondroCMp™ as described herein. I dentification of genes 
5 differentially expressed in Whole blood samples from patients with septic joint arthroplasty 
as compared to healthy patients is determined by statistical analysis using tiie Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Pubhshing Division, 2002). 

Classification or class prediction of a test sample of an individual to detamine 
10 whether said individuals has septic joint arthroplasty, or does not have septic joint 
arthroplasty can be done using the differentially expressed genes identified as described 
above as Ihe predictor genes in combination with well known statistical algorithms as would 
be understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by SiKcon Genetics (e.g. GeneSpring™) for Class 
IS Predication are also available. 

Hepatitis 

Analysis of RNA expression profiles of Whole blood samples fi'om individuals 
having hepatitis as compared with RNA expression profiles fixjm normal individuals. 

This example demonstrates the use of the claimed invention to detect gene expression 
20 in Whole blood samples taken fix>m patients diagnosed with hepatitis as compared to Whole 
blood samples taken fix>m healthy patients. As used herein "hepatitis" refers to an 
inflammation of the liver caused by a virus or toxin and can include h^atitis A, hepatitis B, 
hepatitis C, hepatitis D, hepatitis E, and hepatitis F. Whole blood samples are taken firom 
patients diagnosed with hepatitis as defined herein. KNA expression profiles are then 
25 analysed and compared to profiles firom patients unaffected by any disease. Preferably 
healthy patients are chosen who are age and sex matched to said patients diagnosed with 
disease. In each case, the diagnosis of hepatitis is corroborated by a skilled Board certified 
physician.Total mRNA firom a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescentiy labeled probes for each blood sample is 
30 generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
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differentially expressed in Whole blood samples from patients with hepatitis as compared to 
healthy patients is detennined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

5 Classification or class prediction of a test sample of an individual to determine 

whether said individuals has hepatitis, or does not have hepatitis can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described hereiiL Commercially available programs such as those 

.0 provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Malignant Hyperthermia Susceptibility 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having Malignant Hyperthermia Susceptibility as compared with RNA expression profiles 
from normal individuals. 

15 This example demonstrates tiie use of the claimed invention to detect differential gene 

expression m Whole blood samples taken from patients diagnosed with Malignant 
HyperthOTnia Susceptibility as compared to Whole blood samples taken from healthy 
patients. As used herein "Malignant Hyperthermia Susceptibility*' refers to a 
pharmacogenetic disorder of skeletal muscle calcium regulation often developing during or 

20 after a general anaesthesia. 

Whole blood samples are taken from patients diagnosed with Malignant Hyperthermia 
SusceptibiUty as defined herein. RNA expression profiles are tiien analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who 
are age and sex matched to said patients diagnosed with disease. In each case, the diagnosis 

25 of Malignant Hypertheraiia Susceptibility is corroborated by a skilled Board certified 
physician.Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 

30 differentially expressed in Whole blood samples from patients with Malignant Hyperthermia 
Susceptibility as compared to healthy patients is determined by statistical analysis using the 
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Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed.. New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 

Classijacation or class prediction of a test sample of an individual to determine 
whether said individuals has MaUgnant Hyperthermia SusceptibiUty, or does not have 
5 Mahgnant Hyperthermia SusceptibiUty can be done using the differentially expressed genes 
identified as described above as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially avaUable programs such as those provided by SiUcon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

.0 Osteoarthritic Horses 

Analysis of RNA repression profiles of Whole blood samples &om. horses having 
osteoarthritis as compared with RNA expression profiles &om normal or non-osteoarthritic 
horses. 

This example demonstrates the use of the claimed invention to detect differential gene 
15 expression in Whole blood samples taken 6om horses so as to diagnose equine arthritis as 
compared to Whole blood samples taken firom healthy horses. 

As used herein "arthritis" in reference to horses refers to a degenerative joint disease 
that affects horses by causing lameness. Although it can appear in any joint, most common 
areas are the upper knee joint, firont fetlocks, hocks, or coffin joints in the firont feet. The 

20 condition can be caused by trauma, mmeral or dietary deficiency, old age, poor conformation, 
over exertion or infection. The different structures that can be damaged in arthritis are the 
cartilage inside joints, the bone in the jomts, the joint capsule, the synovial membranes, the 
ligaments around the joints and lastly Ae fluid that lubricates the insides of 'synovial joints'. 
In severe cases all of these stmctures are affected. In for example osteochondrosis only the 

25 cartilage may be affected. 

Regardless of the cause, the disease begins when the synovial fluid that lubricates 
healthy joints begins to thin. The decrease in lubrication causes the cartilage cushion to break 
down, and eventually the bones begin to grind painfully against each other. Diagnostic tests 
used to confirm arthritis include X-rays, joint fluid analysis, and ultrasound. 
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Whole blood samples are taken from horses diagaosed with arthritis as defined herein. 
RNA expression profiles are then analysed and compared to profiles from horses unaJffected 
by any disease. Preferably healthy horses are chosen who are age and sex matched to said 
horses diagnosed with disease. la each case, the diagnosis of arthritis is corroborated by a 
5 certified vet^inarian. 

Total mRNA from a blood sample is taken from each horse and isolated using 
TRfeol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. An equine specific microarray 
10 representing the equine genome can also be used. Identification of genes differentially 

expressed in Whole blood samples from horses with arthritis as compared to healthy horses is 
determined by statistical analysis using the Wilcox Mann Whitney rank-sum test (Glantz SA, 
Primer of Biostatistics, 5th ed. New York, USA: McGraw-ffiU Medical PubUshing Division, 
2002). 

15 Classification or class prediction of a test sample of a horse to determine whether 

said horse has arthritis or does not have arthritis can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled m the art and 
described herem. Commercially available programs such as those provided by Silicon 

20 Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Osteoarthritic Dogs 

Analysis of RNA expression profiles of Whole blood samples from dogs having 
osteoarthritis as compared with RNA expression profiles from normal or non-osteoarthritic 
dogs. 

25 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from dogs so as to diagnose equine arthritis as 
compared to Whole blood samples taken &om healthy horses. 

As used herein "osteoarthritis" in reference to dogs is a form of degenerative joint 
disease which involves the deterioration of and changes to the cartilage and bone, hi 
30 response to inflammation in and about the joint, the body responds with bony remodeling 
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around the joint structure. This process can be slow and gradual with minimal outward 
symptoms, or more r^idly progressive with significant pain and discomfort. Osteoarthritic 
changes can occur in response to infection and injury of the joint as well. 

Whole blood samples are taken j&om dogs diagnosed with osteoarthritis as defined 
5 herein. RNA expression profiles are then analysed and compared to profiles fi-om dogs 
unaffected by any disease. Preforably healthy dogs are chosen who are age, sex and breed 
matched to said dogs diagnosed with disease. In each case, the diagnosis of osteoarthritis is 
coiToborated by a certified veterinarian. 

Total mKNA from a blood sample is taken firom each dog and isolated using TRIzol® 
.0 reagent (GEBCO) and fluorescently labeled probes for each blood sample is generated as 
described above. Each probe is denatured and hybridized to a Affymetrix U133A Chip 
and/or a ChondroChip™ as described herein. A canine specific microarray representing the 
canine genome can also be used. Identification of genes differentially expressed in Whole 
blood samples from dogs with osteoarthritis as compared to healthy horses is determined by 
15 statistical analysis using the Wilcox Mann Whitney rank stmi test (Glantz SA, Primer of 
Biostatistics, 5th ed.. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of a dog to determine whether said 
dog has osteoarthritis or does not have osteoarthritis can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with 
20 well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Manic Depression Syndrome (MDS) as compared with Schizophrenia RNA expression 
profiles 

25 Analysis of RNA expression profiles of Whole blood samples from individuals 

having Manic Depression Syndrome (MDS) as compared with RNA expression profiles from 
individuals having Schizophrenia. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with MDS as compared to 
30 Whole blood samples taken fcom schizophrenic patients. 
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As used herein "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. As used herein, "schizophrenia" is 
defined as a psychotic disorders characterized by distortions of reality and disturbances of 
thought and language and withdrawal from social contact. Patients diagnosed with 

5 "schizophrenia" can include patients having any of the following diagnosis: an acute 

schizophrenic episode, borderline schizophrenia, catatonia, catatonic schizophrenia, catatonic 
type schizophrenia, disorganized schizophrenia, disorganized type schizophrenia, 
hebephrema, hebephrenic schizophrenia, latent schizophrenia, paranoic type schizophrenia, 
paranoid schizophrenia, paraphrenia, paraphrenic schizophrenia, psychosis, reactive 

.0 schizophrenia or the like. 

Whole blood samples are taken firom patients diagnosed with MDS or Schizophrenia 
as defined herein. RNA expression profiles are then analyzed and compared to profiles firom 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of MDS and 
15 Schizophrenia is corroborated by a skilled Boardcertified physician. Total mRNA firom a 
blood sample is taken fi-om each patient and isolated using TRIzol"*" reagent (GIBCO) and 
fluorescently labeled probes for each blood sample is generated as described above. 

Each probe is denatured and hybridized to a Affymetrix U133A Chip and/or a 
ChondroChip(tm) as described herein. Identification of genes differentially expressed in 

20 Whole blood samples fi'om patients with MDS as compared to Schizophrenic patients as 
compared to normal individuals is determined by statistical analysis using the Wilcox Maim 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed.. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002) (data not shown). 294 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the 

25 schizophrenic patients, the MDS patients and those control individuals. The identity of the 
differentially expressed genes is shown in Table SAC. 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has MDS, has Schizophrenia or is normal can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
30 combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring^*^^ for Class Predication are also available. 
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Prediction of Progression of Osteoarthritis. 

Analysis of RNA expression profiles of Whole blood samples of Individuals So as to 
5 Predict Progression of Osteoarthritis. 

This example demonstrates the use of the claimed mvention to predict the progression 
of Osteoarthritis. 

As used herein "osteoarthritis" is a form of degenetative joint disease which involves 
the deterioration of and changes to the cartilage and bone. In response to inflammation in and 
10 about the joint, the body responds with bony remodeling around the joint structure. This 
process can be slow and gradual with minimal outward symptoms, or more rapidly 
progressive with significant pain and discomfort. Osteoarthritic changes can occur in 
response to infection and injury of the joint as well. 

Whole blood samples are taken from test individuals not having any symptoms of 
15 osteoarthritis and RNA expression profiles are then analyzed and compared to profiles from 
individuals having mild osteoarthritis. 

Classification or class prediction of a test sample of said individual to determine 
whether said individual has mild osteoarthritis or does not have osteoarthritis can be done 
using the dififerentially expressed genes identified as described herein as the predictor genes 
20 in combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpringTW) for Class Predication are also avaUable. 

Individuals identified with mild osteoarthritis as a result of this classification have a 
25 significantly greater chance of developing moderate, marked and/or severe osteoarthris than 
those individuals not diagnosed with mild osteoarthritis. 

Therapy 

Microarray Data Analysis of RNA expression profiles of Whole blood samples from 
individuals having a condition as compared with RNA expression profiles from individuals 
30 not having said condition, and wherein said individual is undergoing therapeutic treatment in 
light of said condition. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals undergoing therapeutic treatment 
of a condition as compared with RNA expression profiles from individuals not undergoing 
treatment. 

5 Whole blood samples are taken from patients who are undergoing therapeutic 

treatment. RNA expression profiles are then analysed and compared to profiles from patients 
not undergoing treatment 

Total mRNA from a blood sample takrai fi?om each patient is isolated using TRIzol® 
reagent (GIBCO) and fluorescenUy labeled probes for each blood sample are generated as 
10 described above. Each probe is denatured and hybridized to a microarray for example the 
15K Chondrogene Microarray Chip (ChondroChip™), Affymetrix Genechip or Blood chip as 
described herein. 

Identification of genes differentially expressed in Whole blood samples from patients 
undergoing therq)eutic treatment as conq>ared to patients not undergoing treatment is 
15 determined by statistical analysis using the Wilcox Mann Whitiiey rank sum test (Glantz SA. 
Primer of Biostatistics., differentially expressed genes are then identified as being 
differentially expressed with a p value of < 0.05. 

One skUled in tiie art will appreciate readily that the present invention is weU adapted 
to carry out the objects and obtain the ends and advantages mentioned, as well as those 
20 objects, ends and advantages inherent herein. 

The present examples, along witii the metiiods, procedures, treatinents, molecules, 
and qjecific compounds described herein are presentiy representative of preferred 
embodiments, are exemplary, and are not intended as limitations on the scope of the 
invention. 

25 Changes tiierein and other uses will occur to those skilled in the art which are 

encompassed within tiie spirit of the invention as defined by tiie scope of tiie claims. 

All patents, patent applications, and published references cited herein are hereby 
incorporated by reference in their entirety. While this invention has been particularly shown 
and described with references to preferred embodiments tiiereof, it will be understood by 
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those skilled in the art that various changes in fonn and details may be made therein without 
departing from the scope of the invention encompassed by the appended claims. 
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5 1 . A method of identifying one or more biomarkers for a disease of interest, wherein each of 
said one or more biomarkers corresponds to an KNA transcript, comprising the method 
steps of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
jfrom one or more individuals having said disease of interest, wherein each of said one or 

10 more RNA transcripts is a candidate biomarker for said disease of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals not having said disease of interest, 

wherein the RNA transcripts which display differing levels in the comparison of step 
15 b), are identified as being biomarkers for said disease of interest, 

and/or 

c) detennining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said disease of interest, wherein each of said one or 
more transcripts is a candidate biomarker for said disease of interest; and 

20 d) comparing the level of each of said one or more RNA transcripts from said step c) 

with the level of each of said one or more transcripts in blood obtained from one or more 
individuals having said disease of interest, 

wherein the RNA transcripts which display the same levels in the comparison of step 
d), are identified as being biomarkers for said disease of interest. 

25 

2. The method of claim 1, wherein the disease is selected &om the group consisting of: liver 
cancer, bladder cancer, brain cancer, prostate cancer, ovarian cancer, kidney cancer, gastric 
cancer, lung cancer, breast cancer, nasopharyngeal cancer, pancreatic cancer, osteoarthritis. 
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depression, hypertension, heart failure, obesity, rheumatoid arthritis, hyperlipidemia, lung 
disease, chagas disease, allergies, schizophrenia, asthma, manic depression syndrome, 
ankylosing spondylitis, guillainbarre syndrome, fibromyalgia, multiple sclerosis, muscular 
dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or colitis, or malignant 
hyperthermia susceptibility, psoriasis, thyroid disorder, irritable bowel syndrome, 
osteoporosis, migraines, eczema, or a heart murmer. 

3. A method of identifying one or more biomarkers of a stage of progression or regression 
of a disease of interest, wherein each of said one or more biomarkers corresponds to an 
RNA transcript, comprising the steps of: 

a) determining tiie level of one or more KNfA transcripts expressed in blood obtained 
from one or more individuals having a stage of said disease of interest, wherein said one 
or more individuals are at the same progressive or regressive stage of said disease of 
interest, and wherein each of said one or more transcripts is a candidate biomarker for 
determining the stage of progression or regression of said disease of interest, and; 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more transcripts in blood obtained from one or more 
individuals who are at a progressive or regressive stage of said disease of interest distinct 
from that of said one or more individuals of step a), 

wherein those compared transcripts which display differing levels in the comparison 
- of step b) are identified as being biomarkers for tiie stage of progression or regression of 
said disease of interest, 

and/or 

c) determming the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having a stage of said disease of interest, wherein said one or 
more individuals are at the same progressive or regressive stage of said disease of mterest, 
and wherein each of said one or more transcripts is a candidate biomarker for detennining the 
stage of progression or regression of said disease of interest, and; 

d) comparing the level of each of said one or more RNA transcripts from said step c) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals who are at a progressive or regressive stage of said disease of interest as 
that of said one or more individuals of step c), 

wherein those compared transcripts which display the same levels in the comparison 
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of Step d) are identified as being biomarkers for the stage of progression or regression of said 
disease of interest. 



4. The method of claim 3, wherein said disease of interest is selected firom the group 
5 consisting of: Hver cancer, bladder cancer, brain cancer, prostate cancer, ovarian cancer, 
kidney cancer, gastric cancer, lung cancer, breast cancer, nasopharyngeal cancer, pancreatic 
cancer, osteoarthritis, depression, hypertension, heart failure, obesity, rheumatoid arthritis, 
hyperlipidemia, lung disease, chagas disease, allergies, schizophrenia and asthma, manic 
depression syndrome, ankylosing spondylitis, guillain barre syndrome, fibromyalgia, multiple 
10 sclerosis, muscular dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or colitis, 
malignant hyperthermia susceptibility, psoriasis, tihiyroid disorder, irritable bowel syndrome, 
osteoporosis, migraines, eczema, or a heart murmer, alzheuner's, CAD, Diabetes, or 
colorectal cancer. 

15 5, A method of identifying one or more biomarkers for a condition of interest, wherem 
each of said one or more biomarkers corresponds to an RNA transcript, comprising the steps 
of: 

a) determining the level of one or more KNA transcripts expressed in blood obtained 
from one or more individuals having said condition of interest, wherein each of said one or 

20 more transcripts is a candidate biomarker for said condition of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals not having said condition of interest, 

wherein those compared transcripts which display differing levels in the comparison 
25 of step b) are identified as being biomarkers for said condition of interest, 

and/or 

c) determining the level of one or more RNA transcripts expressed in blood obtained 
fi^m one or more individuals having said condition of interest, wherein each of said one or 
more transcripts is a candidate biomarker for said condition of interest; and 

30 d) comparing the level of each of said one or more RNA transcripts from said step c) 
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with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals having said condition of interest, 

wherein those compared transcripts which display the same levels in the comparison 
of step d) are identified as being biomarkers for said condition of interest. 

5 

6. The method of claim 5, wherein said condition of interest is the condition resulting 
from the administration of medication wherein said medication is selected from the group 
consisting of: Celebrex, Vioxx, NSAIDS, Cortozone, Hyaluronic Acid, Systemic Steroids, 
hormone replacement therapy, pregnazone and birth control pills. 

10 7. The method of claim 5, wherein said condition of interest is the condition resulting 
from exposure to an environmental condition, wherein said environmental condition is 
cigarette smoke. 

8. A method of identifying one or more biomarkers to differentiate between a pair of 
disease and/or conditions, wherein said pair consists of a first and a second disease or 

15 condition of interest, and wherein each of said one or more biomarkers corresponds to an 
RNA transcript, comprising the steps of: 

a) determming the level of one or more KNA transcripts expressed in blood obtained 
bom one or more individuals having said first disease or condition of interest while not 
having said second disease or condition of interest, wherein each of said one or more 

20 transcripts is a candidate biomaiker for differentiating said first disease or condition of 
intCTest from said second disease or condition of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more genes transcripts in blood obtained from one or 
more individuals having said second disease or condition of interest while not having said 

25 first disease or condition of interest, 

wherein those compared transcripts which display differing levels in the comparison 
of step b) are identified as being biomarkers for differentiating said first disease or condition 
of interest from said second disease or condition of interest. 

9. The method of claim 8, wherein said first and second disease or condition of interest of 
30 said pair of disease and/or conditions, are selected from the group consisting of rheumatoid 

arthritis, osteoarthritis, schizophrenia, manic depression syndrome, liver cancer, hepatitis, 
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bladder cancer, kidney cancer, bladder cancer, testicular cancer, pancreatic cancer, kidney 
cancer, liver cancer, stomach cancer, colon cancer, Chagas disease, heart failure, coronary 
artery disease, asymptomatic Chagas Disease, symptomatic Ch^as Disease, alzheimer's 
Disease, allergies, systemic steroids, allergy, Type n Diabetes, obesity, hypertension, , 
5 hyperUidemia, lung disease, bladder cancer, asthma, psoriasis, thyroid disorder, irritable 
bowel syndrome, osteoporosis, migraine headaches, excema, NASH, Crohn's colitis, chronic 
cholecystitis, cervical cancer, cardiovascular disease, and neurological disease. 

10. The method of any one of claims 1-2, wherein said disease of interest is hypertension 
and wherein said one or more RNA transcripts are transcribed from one or more genes 

10 selected from the group consisting of the genes Usted in Tables 1 A, IE, IP and IQ, or 

wherein said disease of interest is obesity and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes Usted in 
Tables IB, IF, IR, and IS, or wherein said disease of interest is allergies and wherein said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 

15 consisting of the genes listed in Tables IC, IT, and lU, or wherein said disease of interest is 
type n diabetes, and wherem said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table IG, and wherein 
said marker does not identify the insulin gene, or wherein said disease of interest is 
hyperlipidemia, and wherein said one or more RNA transcripts are transcribed from one or 

20 more genes selected from the group consisting of the genes listed in Table IH, or wherein 
said disease of interest is lung disease, and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Tablell, or wherein said disease of interest is bladder cancer, and wherein said one or more 
RNA transcripts are transcribed from one or more genes selected from the group consisting of 

25 the genes listed in Tables IJ and IK, or wherein said disease of interest is coronary artery 
disease, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table IL wherein said marker 
does not identify a gene selected from the group consisting of ANF, ZFP and DMyHC, or 
wherein said disease of interest is rheumatoid arthritis, and wherein said one or more RNA 

30 transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Table IM, or wherein said disease of interest is osteoarthritis, and wherein 
said one or more RNA transcripts are transcribed from one or more genes selected from the 
group consisting of the genes Usted in lAB, or wherein said disease of mterest is depression. 
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and wherein said one or more RNA transcripts are transcribed from one or more genes 
selected from the group consisting of the genes listed in Table IN, or wherein said disease of 
interest is liver cancer, and wherein said one or more RNA transcripts are transcribed from 
one or more genes selected from the group consisting of the gmes listed in Table IX, or 

5 wherein said disease of interest is chagas disease, and wherein said one or more RNA 
transcripts are transcribed from one or more genes selected from the group consisting of the 
genes hsted in Table IZ, or wherein said disease of interest is asthma, and wherein said one 
or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table lAA, or wherein said disease of interest is ankylosing 

10 spondylitis, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table 1AM, wherein said 
disease of interest is manic depression, and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consistmg of the genes listed in 
Table IAN, or wherein said disease of interest is alzheimers disease, and wherein said one or 

15 more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes hsted in Table 1 AO and wherein said marker does not identify the 
APP gene, or wherein said disease of interest is cervical cancer, and wherein said one or more 
RNA transcripts are transcribed from one or more genes selected from the group consisting of 
the genes listed in Table 1 AQ, or wherein said disease of interest is gastric cancer, and 

20 wherein said one or more RNA transcripts are transcribed from one or more genes selected 
from the group consisting of the genes listed in Table 1 AR, or wherein said disease of 
interest is kidney cancer, and wherein said one or more RNA transcripts are transcribed from 
one or more genes selected from the group consisting of the genes listed in Table IAS, or 
wherein said disease of interest is testicular cancer, and wherein said one or more RNA 

25 transcripts are transcribed from one or more genes selected from the group consisting of the 
genes Usted in Table 1 AT, or wherein said disease of interest is colon cancer, and wherein 
said one or more RNA transcripts are transcribed from one or more genes selected from the 
group consisting of the genes Usted in Table lAU, or wherein said disease of interest is heart 
failure, and wherem said one or more RNA transcripts are transcribed from one or more 

30 genes selected from the group consisting of the genes listed in Table lAV, or wherein said 
disease of interest is hepatitis, and wherein said one or more RNA transcripts are transcribed 
from one or more genes selected from the group consisting of the genes hsted in Table lAW, 
or wherein said disease of interest is either Crohn's disease or colitis, and wherein said one 
or more RNA transcripts are transcribed from one or more genes selected from the group 
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consisting of the genes listed in Table lAX, or wherein said disease of uiterest is 
osteoporosis, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table lAY, or wherein said 
condition of interest is the taking of systemic steroids, and wherein said one or more RNA 
5 transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Tables ID, IV, 1 W, and IAD. 

1 1 . The method of any one of claims 5, 6 or 7, wherem said condition of interest is the 
administration of systemic steroids and said one or more RNA transcripts are transcribed 
from one or more genes selected from the group consisting of the genes listed in Tables ID, 

10 IV, IW and AD, wherein said condition of interest is the administration of Celebrex, and said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 7B, wherein said condition of interest is the 
administration of Vioxx, and said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 7C, wherein said 

15 condition of interest is the administration of NSAIDs, and said one or more RNA transcripts 
are transcribed from one or more genes selected from the group consisting of the graes listed 
in Table 7E, wherein said condition of interest is the administration of Cortisone, and said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 7F, wherein said condition of mterest is the 

20 administration of Lipitor, and said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 7H, and wherein 
said condition of interest is one of smokmg, and said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Table 71. 

25 12. A method of identifying one or more biomarkers specific for a group of related 
diseases and/or conditions of interest, wherem said group of related diseases and/or 
conditions comprise those diseases and/or conditions which display a similar phenotype 
and/or which originate from and/or effect the same physiological system, or which have the 
same or similar etiology, or which are medically classified together, and wherein each of said 

30 one or more biomarkers corresponds to a gene transcript, comprising the steps of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having a disease and/or condition that is encompassed by said 
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group of related diseases and/or conditions of interest, wherein each of said one or more 
transcripts is transcribed j&om a gene that is a candidate biomarker for said group of related 
diseases and/or conditions of interest , 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
5 with the level of each of said one or more genes transcripts in blood obtained from one or 
more individuals not having a disease and/or condition that is encompassed by said group of 
related diseases and/or conditions of interest, 

wherein those compared transcripts which display differing levels in the comparison 
of step b) are identified as being biomarkers for identifying one or more biomarkers specific 
10 for said group of related diseases and/or conditions of interest. 

13. The method of claim 12, wherein said group of related diseases and/or conditions of 
interest is selected from the following groups of related diseases and/or conditions: cancer, 
cardiovascular disease and neurological disease. 

14. The method of claim 13 wherein the disease of cancer is consists of: cervical cancer, 
15 stomach cancer, kidney cancer, testicular cancer, bladder cancer, liver cancer, limg cancer 

and colon cancer 

15. The method of claim 13 wherein the cardiovascular disease and/or condition consists 
of coronary artery disease, heart failure and hypertension. 

16. The method of clahn 13, wherein said neurological disease and/or condition consists of 
20 alzheimer's disease. Manic Depression and 

Schizophrenia. 

17. A method of diagnosing or prognosing a condition of interest and/or a stage of 
progression or regression thereof, in an individual suspected as having said condition of 

25 interest, comprising the steps of: 

a) determining the level of one or more gene transcripts expressed in blood 
corresponding to the biomarkers identified in any one of claims 1 through 16 and claim 22, 
obtained from said individual, and 

b) comparing the level of each of said one or more gene transcripts in said blood 

30 according to step a) with the level of each of said one or more gene transcripts in blood from 
one or more individuals having said condition of interest, 
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c) comparing the level of each of said one or more gene transcripts in said blood 
according to step a) with the level of each of said one or more gene transcripts in blood from 
one or more individuals not having said condition, 

d) detennining whether the level of said one or more gene transcripts of step a) classify 
with the levels of said transcripts in step b) as compared with levels of said transcripts in step 
c), wherein said determination is indicative of said individual of step a) having said disease 
of interest. 

18. A kit comprising one or more biomarkers identified in any one of claims 1 through 16 and 
claim 22. 

19. A kit for diagnosing or prognosing a condition of interest comprising: a) two gene- 
specific priming means designed to produce double stranded DNA complementary to a 
transcript which correlates to one or more biomarkers identified in any one of claims 1 
through 16 and claim 22, wherein said first priming means contains a sequence which can 
hybridize to said RNA transcript to create an extension product and said second priming 
means capable of hybridizing to said extension product; b) an enzyme with reverse 
transcriptase activity c) an enzyme with thermostable DNA polymerase activity and d) a 
labeling means; wherein said primers are used to detect the quantitative expression levels of 
said RNA transcript(s> in a test subject. 

20. A kit for monitoring a course of therapeutic treatment of a condition of interest, 
comprising a) two gene-specific priming means designed to produce double stranded DNA 
complementary to a transcript which correlates to one or more biomarkers identified in any 
one of claims 1 through 16 and claim 22, wherein said first priming means contains a 
sequence which can hybridize to said RNA transcript to create an extension product and said 
second priming means enable of hybridizing to said extension product; b) an enzyme with 
reverse transcriptase activity c) an enzyme with thermostable DNA polymerase activity and 
d) a labeling means; wherem said primers are used to detect the quantitative expression levels 
of said RNA transcript(s) in a test subject. 

21 . A kit for monitoring progression or regressionof coronary artery disease, comprising: a) 
two gene-specific priming means designed to produce double stranded DNA complementary 
to a transcript which correlates to one or more biomarkers identified in any one of claintis 1 
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through 16 and claira 22, wherein said first priming means contains a sequence which can 
hybridize to said RNA transcript to create an extension product and said second priming 
means capable of hybridizing to said extension product; b) an enzyme with reverse 
transcriptase activity c) an enzyme with thermostable DNA polymerase activity and d) a 
5 labeling means; wherein said primers are used to detect the quantitative egression levels of 
said RNA tran5cript(s) in a test subject. 

22. The method of claim 9, wherein said first and second disease or condition of interest 
of said pair of disease and/or conditions, are selected fi:om the group consisting of 
rheumatoid arthritis and osteoarthritis, respectively, schizophrenia and manic depression 

10 syndrome, respectively, liver cancer and hepatitis,respectively, bladder cancer and kidney 
cancer, respectively, bladder cancer and testicular cancer, respectively, testicular cancer and 
kidney cancer, respectively, liver cancer and stomach cancer, respectively, liver cancer and 
colon cancer, respectively, stomach cancer and colon cancer, respectively, Chagas disease 
and heart failure, respectively, Chagas disease and coronary artery disease, respectively, 

15 coronary artery disease and heart failure, respectively, asymptomatic Chagas Disease and 
symptomatic Chagas Disease, respectively, Alzheimer's Disease and Schizophrenia, 
respectivley, and Alzheimer's Disease and Manic Depression, respectively. 
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Figure i 
Hypertension 
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Figure 2 
Obesity 
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Figures 4 
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Figure 5 
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Figure 6 
Obesily 
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Figure 7 
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Figure 8 intersecting 

Genes 
(Table 1 J) 




6/25 



wo 2004/112589 



PCT/US2004/020836 



Figure 9 intersecting 
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(Table 1L) 




7/25 



wo 2004/112589 



PCT/US2004/020836 



Figure 10 
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Figure 1 1 
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Diabetea 
Type2 
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Figure 13 

HyperiipiueLii 
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Lung Disease: 3 
Nan-lung Disease: 25 
Normal: 24 

Differentially Expressed Genes (p<0.05): 596 
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Bladder Cancer: 5 

Non-bladder Cancer 18 

Diffirentially Ejcpressed Genes (p<0.05): 4,228 
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Figure 18 
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Figure 19 
Depression 
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Figure 20 
Osteoarthritis 




Osteoarthritis: 9 mild, 8 moderate, 8 marked, 9 severe 
Nonnal: 9 

Differentially Expressed Genes (p<0,05): 300 
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Figure 21 
Liver Cancer 
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Figure 22 
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Figure 23 
Chagas Disease 
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Figure 24 
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Figure 25 



Schizophrenia, Batch 2 vs. MDS 
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Figure 26 
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